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Method for the expression of genes in plants. 



_ A method for the expression of genes in plants, parts of 
plants, and plant celi cultures, in which a DNA fragment Is 
used comprising an inducible plant promoter of root nodule- 
specific genes, DNA-fragments comprising an inducible 
plant promoter, to be used when carrying out the method, 
said DNA-fragments being identical with, derived from or 
comprising a 5' flanking region of root nodule : specific 
genes of any origin as well as plasrriids and transformed 
Agrobacterlum rhizogenes-strain which can be used when 
carrying out the method. 
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A method for the expression of genes in plants, 
parts of plants, and plant cell cultures, and DNA 
fragments, plasmids . and transformed microorganisms 
to be used when carrying out the method, as veil 
5 as the use thereof for the expression of genes in 
plants, parts of plants, and plant cell cultures. 

The invention relates to a novel method for the 
expression of genes In plants, parts of plants, 
and plant cell cultures, as well as DNA fragments 
10 and plasmids comprising said DNA fragments to be 
used when carrying out the method. The invention 
furthermore relates to transformed plants, parts 
of plants and plant cells. 

The invention relates to this method for the ex- 
15 pression of genes of any origin under control of 
an inducible, root nodule specific promoter. 

The invention relates especially to this method 
for the expression of root nodule - specif ic genes 
in transformed plants including both leguminous 
20 plants and other plants. 

The invention relates furthermore to DNA fragments 
comprising an inducible plant promoter to be used 
when carrying out the method, as well as plasmids 
comprising said DNA fragments. 

25 In the specification i.a. the following terms are 
used: 

Root nodule - specif ic genes: Plant genes active 
only in the root nodules of leguminous plants, or 
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genes with an increased expression in root nodules. 
Root nodule - specif ic plant genes are expressed at 
predetermined stages of development and are ac- 
tivated in a coordinated manner during the symbiosis 
5 whereby a nitrogen fixation takes place and the 
fixed nitrogen is utilized in the metabolism of 
the plant. 

Inducible plant promoter: Generally is meant a 
promoter-active 5' flanking region from plant genes 

10 inducible from a low activity to a high activity. 
In relation to the present invention "inducible 
plant promoter" means a promoter, derived from, 
contained in or being identical with a 5' flanking 
region including a leader sequence of root nodule - 

15 specific genes and being capable of promoting and 
regulating the expression of a gene as characterised 
in relation to the present invention. 

Leader sequence: Generally is meant a DNA sequence 
being transcribed into a mRNA, but not further 

20 trans lated into protein. The leader sequence com- 
prises thus the DNA fragment from the start of the 
transcription to the ATG codon constituting the 
start of the translation. In relation to the present 
invention "leader sequence" means a short DNA frag- 

25 merit contained in the above inducible plant promoter 
and typically comprising 40-70 bp and which may 
comprise sequences being targets for a posttran- 
scriptional regulation. 

Promoter region: A DNA fragment containing a pro- 
30moter which comprises target sequences for RNA 
polymerase as well as possible activation regions 
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comprising target sequences for transcriptional 
effector substances. In the present invention, 
target sequences for transcriptional effectors may 
also be situated 3' to the promoter, i.e. in the 
5 coding sequences , the intervening sequences or on 
the 3' flanking region of a root nodule - specif ic 
gene . 

Furthermore a number of molecular-biological terms 
generally known to persons skilled in the art are 
loused, including the terms stated below: 

CAP (addition) site: The nucleotide of the 5' end 
of the transcript where 7-methylGTP is added; In 
the Figures often given also as an asterisk *~marked 
nucleotide on a given nucleotide sequence. 

15 DNA sequence or DNA segment: A linear array of 
nucleotides interconnected through phosphodiester 
bonds between the 3' and 5' carbon atoms of adjacent 
pento ses * 

Express ion : The process undergone by a structural 
20 g ene t0 produce a polypeptide. It is a combination 
of transcription and translation as well as possible 
posttranslational modifications. 

Flanking regions: DNA sequences surrounding coding 
regions. 5' flanking regions contain a promoter. 
25 3' flanking regions may contain a transcriptional 
terminator etc. 

Gene : A DNA sequence composed of three or four 
parts, viz. (1) the coding sequence for the gene 
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product, (2) the sequences in the promoter region 
which control whether or not the gene will be ex- 
pressed, (3) those sequences in the 3' end con- 
ditioning the transcriptional termination and op - 
5 tionally polyadenylation, as well as (4) interven- 
ing sequences, if any. 

Intervening sequences: DNA sequences within a gene 
which are not coding for any peptide fragment. The 
intervening sequences are transcribed Into pre-mRNA 
10 and are eliminated by modification of pre-mRNA 
into mRNA. They are also called introns . 

Chimeric gene: A gene composed of parts from various 
genes. E.g. the chimeric Lbc3 - 5 ' - 3 r - CAT is composed 
of a chloroamphenicolacetyltransf erase - coding se- 
15 quence deriving from E. coli and 5 ' and 3' flanking 
regulatory regions of the Lbc3 gene of soybean. 

Cloning : The process of obtaining a population of 
organisms or DNA sequences deriving from one such 
organism or sequence by asexual reproduction, or 
20more particular a process of isolating a particular 
organism or part thereof, and the propagation of 
this subfraction as a homogeneous population. 

Coding sequences: DNA sequences determining the 
amino acid sequence of a polypeptide. 

25 Cross - inoculation group: A group of leguminous 
plant species capable of producing functionally 
active root nodules with Rhizobium bacteria Isolated 
from root nodules of other species of the group. 
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Le ^hemoglobin (Lb) : An oxygen-binding protein ex- 
clusively synthesized in root nodules. The Lb pro- 
teins regulate the oxygen partial pressure in the 
root nodule tissue and transport oxygen to the 
5 bacteroides. In this manner the oxygen- sensitive 
nitrogenase enzyme is protected. The Lb genes are 
root nodule - spec if ic genes. 

Messenger-RNA (mRNA^ : RNA molecule produced by tran- 
scription of a gene and possibly modification of 
10 mRNA. The mRNA molecule mediates the genetic message 
determining the amino acid sequence of a polypeptide 
by part of the mRNA molecule being translated into 
said peptide. 

Downstream : A position in a DNA sequence. It is 
25 defined relative to the transcriptional direction 
5'- 3' of the gene relative to which the position 
is stated. The 3' flanking region is thus posi- 
tioned downstream of the gene. 

Nucleotide : A monomeric unit of DNA or RNA con- 
2o sisting of a sugar moiety (pentose), a phosphate, 
and a nitrogeneous heterocyclic base. The base is 
linked to the sugar moiety via a glycosidic bond 
(1' carbon of the pentose), and this combination 
of base and sugar is a nucleoside. The base cha- 
25 racterises the nucleotide. The four DNA bases are 
adenine (A), guanine (G) , cytosine (C) , and thymine 
(T) . The four RNA bases are A, G, C, and uracil (U) . 

Up s tream : A position in a DNA sequence. It is de- 
fined relative to the transcriptional direction 
30 5 ' - 3' of the gene relative to which the position 
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Is stated. The 5' flanking region is thus positioned 
upstream of this gene. 

Plant transformation: Processes leading to incor- 
poration of genes in the genome of plant cells in 
5 such a manner that these genes are reliably in- 
herited through mitosis and meiosis or in such a 
manner that these genes are only maintained for 
short periods. 

Plasmid: An extra -chromosomal double -stranded DNA 
lO sequence comprising an intact replicon such that 
the plasmid is replicated in a host cell. When the 
plasmid is placed within a unicellular organism, 
the characteristics of that organism are changed 
or transformed as a result of the DNA of the plas- 
15 mid. For instance a plasmid carrying the gene for 
tetracycline resistance (Tc^-) transforms a cell 
previously sensitive to tetracycline into one which 
is resistant to it. A cell transformed by a plasmid 
is called a transf ormant . 

20 Polypeptide : A linear array of amino acids inter- 
connected by means of peptide bonds between the 
oc- amino and carboxy groups of adjacent amino acids. 

Recombination : The creation of a new DNA molecule 
by combining DNA fragments of different origin. 

25 Homologous recombination: A recombination between 
sequences showing a high degree of homology. 

Replication: A process reproducing DNA molecules. 
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Replicon : A se If - replicating genetic element pos- 
sessing an origin for the initiation of DNA re- 
plication and genes specifying the functions neces- 
sary for a control and a replication thereof. 

5 Restriction fragment: A DNA fragment resulting from 
double - stranded cleavage by an enzyme recognizing 
a specific target DNA sequence. 

RNA polymerase: Enzyme effecting the transcription 
of DNA into RNA . 

10 Root nodule: Specialized tissue resulting from 
infection of mainly roots of leguminous plants 
with Rhizobium bacteria. The tissue is produced by 
the host plant and comprises therefore plant cells 
whereas the Rhizobium bacteria upon infection are 

15 surrounded by a plant cell membrane and differen- 
tiate into bacteroides. Root nodules are produced 
on other species of plants upon infection of nitro- 
gen-fixing bacteria not belonging to the Rhizobium 
genus. Root nodule - specif ic plant genes are also 

20 expressed in these nodules. 

Southern-hybridization: Denatured DNA is transferred 
upon size separation in agarose gel to a nitro- 
cellulose membrane. Transferred DNA is analysed 
for a predetermined DNA sequence or a predetermined 

25 gene by hybridization. This process allows a binding 
of s ingle - s tranded , radioac tively marked DNA se- 
quences (probes) to complementary single - stranded 
DNA sequences bound on the membrane. The position 
of DNA fragments on the membrane binding the probe 

3Qcan subsequently be detected on an X-ray film. 
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Symbiotic nitrogen fixation: The relationship where- 
by bacteroides of root nodules convert the nitrogen 
(dinitrogen) of the air into ammonium utilized by 
the plant while the plant provides the bacteroides 
5 with carbon compounds as a carbon source . 

Symbiont : One part of a symbiotic relationship, and 
especially Rhizobium is called the microsymbiont . 

Transformation : The process whereby a cell is incor- 
porating a DNA molecule. 

lO Translation: The process of producing a polypeptide 
from mRNA or: 

the process whereby the genetic information present 
in a mRNA molecule directs the order of specific 
amino acids during the synthes is of a polypeptide . 

^5 Transcription: The method of synthesizing a com- 
plementary RNA sequence from a DNA sequence. 

Vector : A plasmid, phage DNA or other DNA sequences 
capable of replication in a host cell and having 
one or a small number of endonuclease recognition 
2o sites at which such DNA sequences may be cleaved 
in a determinable manner without loss of an es- 
sential biological function. 

Traditional plant breeding is based on repeated 
crossbreeding of plant lines individually carrying 
25 desired qualities. The identification of progeny 
lines carrying all the desired qualities is a par- 
ticularly time-consuming process as the biochemical 



and genetic basis of the qualities is usually un- 
known. New lines are therefore chosen according to 
their phenotype, usually after a screening of many 
lines In field experiments. 

5 Through the ages a direct connection has existed 
* between the state of nutrition, i.e. the health, 
of the population and the agricultural possibility 
of ensuring a sufficient supply of assimilable 
nitrogen in order to obtain satisfactory yields. 

10 Already in the seventeenth century it was discovered 
that plants of the family leguminosae including 
beyond peas also beans, lupins, soybean, bird' s-foot 
trefoil, vetches, alfalfa, sainfoin, and trefoil had 
an ability of improving crops grown on the habitat 

15 of these plants. Today it is known that the latter 
is due to the fact that the members of the plants 
of the family leguminosae are able to produce nitro- 
gen reserves themselves. On the roots they carry 
bacteria with which they live in symbiosis. 

20 An infection of the roots of these leguminous plants 
with Rhiz ob ium bacteria causes a formation of root 
nodules able to convert atmospheric nitrogen into 
bound nitrogen, which is a process called nitrogen 
fixation . 

25 Atmospheric nitrogen is thereby converted into forms 
which can be utilized by the host plant as well as 
by the plants later on growing on the same habitat. 

In the nineteenth century the above possibility was 
utilized for the supply of nitrogen in order to 
30 achieve a novel increase of the crop yield. 
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The later further increases in the yield have, how- 
ever, especially been obtained by means of natural 
fertilizers and nitrogen- containing synthetic fer- 
tilizers. The resulting pollution of the environ- 
5 ment makes it desirable to provide alternative 
possibilities of ensuring the supply of nitrogen 
necessary for the best possible yields obtainable. 

It would thus be valuable to make an improvement 
possible of the existing nitrogen fixation systems 
lO in leguminous plants as well as to allow an in- 
corporation of nitrogen fixation systems in other 
plants . 

The recombinant DNA technique and the plant trans- 
formation systems developed render it now possible 

15 to provide plants with new qualities in a well - 
controlled manner. These characteristics can derive 
from not only the same plant species, but also 
from all other prokaryotic or eukaryotic organisms. 
The DNA techniques allow further a quick and spe- 

20 cif ic identification of progeny lines carrying the 
desired qualities. In this manner a specific plant 
line can be provided with one or more desired qual- 
ities in a quick and well-defined manner. 

Correspondingly, plant cells can be provided with 
25 well defined qualities and subsequently be main- 
tained as plant cell lines by means of known tissue 
culture methods. Such plant cells can be utilized 
for the production of chemical and biological prod- 
ucts of particular interest such as dyes, flavours, 
30 aroma components, plant hormones, pharmaceutical 
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products , primary and secondary metabolites as 
well as polypeptides (enzymes). 

A range of factors and functions necessary for 
biological production of a predetermined gene pro- 
5 duct are known. Both the initiation and regulation 
of transcription as well as the initiation and 
regulation of pos t trans scrip tional processes can 
be characterised. 

At the gene level it is known that these functions 
lO are mainly carried out by 5 f flanking regions. A 
wide range of 5 ' flanking regions from prokaryotic 
and eukaryotic genes has been sequenced, and in 
view inter alia thereof a comprehensive knowledge 
has been provided of the regulation of gene ex- 
15 pression and of the sub-regions and sequences being 
of importance for the regulation of expression of 
the gene. Great differences exist in the regulatory 
mechanism of prokaryotic and eukaryotic organisms, 
but many common features apply to the two groups. 

20 The regulation of the expression of gene may take 
place on the trans scrip tional level and is then 
preferably exerted by regulating the initiation 
frequency of trans scrip t ion . The latter is well- 
known and described inter alia by Benjamin Lewin, 

25 Gene Expression, John Wiley & Sons, vol. I, 1974, 
vol. II, Second Edition 1980, vol. Ill, 1977. As 
an alternative the regulation may be exerted at 
the pos t trans scr ip tional level, e.g. by the re- 
gulation of the frequency of the translation ini- 

30 tiation, at the rate of the translation, and of 
the termination of the translation. 
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The present invention is based on the surprising 
finding that 5' flanking regions of root nodule- 
specific genes, exemplified by the 5' flanking 
region of the soybean leghemoglob in Lbc3 gene, can 
5 be used for inducible expression of a foreign gene 
in an alien leguminous plant. The induction and 
regulation of the promoter is preferably carried 
out in the form of a regulation and induction at 
the transscriptional level and differs thereby 
lO from the inducability stated in Patent Application 
No. 86114704. 9 r the latter inducability preferably 
being carried out at the translation level. 

The transscription of both the Lbc3 gene of the 
soybean and of a chimeric Lbc3 gene transferred to 

15 bird' s-foot trefoil starts at a low level immediate- 
ly upon the appearance of the root nodules on the 
plant roots. Subsequently, a high increase of the 
transscription takes place immediately before the 
root nodules turn red. The transcription of a range 

20 of other root nodule - specif ic genes is initiated 
exactly at this time. The simultaneous induction 
of the transscription of the Lb genes and other 
root nodule - specif ic genes means that a common DNA 
sequence(s) must be present for the various genes 

25 controlling this pattern of expression. Thus the 
leghemoglob in-C3 gene is a representative of one 
class of genes and the promoter and the leader 
sequence, target areas for activation as well as 
the control elements of the organ specificity of 

30 the Lbc3 gene are representatives of the control 
elements of a complete gene class. 



. ; 02496 



13 

The promoter of the 5' flanking regions of the Lb 
genes functions In soybeans and is responsible for 
the transcription of the Lb genes in root nodules. 
It is furthermore known, that the efficiency of 
5 both the transcription initiation and the subsequent 
translation initation on the leader sequence of the 
Lb genes is high as the Lb proteins constitute ap- 
proximately 20% of the total protein content in 
root nodules . 

IQThe sequence of 5' flanking regions of the four 
soybean leghemoglobin genes Lba, Lbc lf Lbc 2 , and 
Lbc 3 appears from the enclosed sequence scheme, 
scheme 1 F wherein the sequences are stated In such 
.a manner that the homology between the four 5' 

15 flanking regions appears clearly. 

In the sequence scheme Indicates that no base 

is present in the position in question. The names 
of the genes and the base position counted upstream 
from the ATG start codon are indicated to the right 
20 of the sequence scheme. Furthermore the important 
sequences have been underlined. 

As it appears from the sequence scheme a distinct 
degree of homology exists between the four 5' flan- 
king regions, and in the position 23-24 bp upstream 

25 from the CAP addition site they all contain a 
TATATAAA sequence corresponding to the "TATA" box 
which In eukaryotic cells usually are located a 
corresponding number of bp upstream from the CAP 
addition site. Furthermore a CCAAG sequence is 

30 present 64-72 bp upstream from the CAP addition 
site, said s quence corresponding to the "CCAAT" 
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box usually located 70-90 bp upstream from the CAP 
addition site. From the CAP addition site to the 
translation start codon, ATG , leader sequences of 
52-59 bp are present and show a distinct degree of 
5 homology of approx. 75-80%. 

In accordance with the present invention it has 
furthermore been proved, exemplified by the Lbc3 
gene, that the 5' flanking regions of the soybean 
leghemoglobin genes are functionally active in 

10 other plant species. The latter has been proved by 
fusioning the E. coli chloramphenicol acetyl trans- 
ferase (CAT) gene with the 5' and 3' flanking re- 
gions of the soybean Lbc3 gene in such a manner 
that the expression of the CAT gene is controlled 

25 by the Lb promoter. This fusion fragment was cloned 
into the integration vectors pARl and pAR22, where- 
by the plasmids pAR29 and pAR30 were produced. 
Through homologous recombination the latter plasmids 
were integrated into the Agrobacterium rhizogenes 

2o T DNA region. The transformation of Lotus cornicu- 
latus (bird's-foot trefoil) plants, i.e. transfer 
of the T DNA region, was obtained by wound infection 
on the hypokotyl. Roots developed from the trans- 
formed plant cells were cultivated in vitro and 
25 freed from A. rhizogenes bacteria by means of anti- 
biotics. Completely regenerated plants were produced 
by these root cultures in a conventional manner 
through somatic embryogenes is or organogenesis. 

Regenerated plants were subsequently inoculated 
30 with Rhizobium loti bacteria and root nodules for 
analysis were harvested. Transcription and trans- 
lation of the chimeric Lbc3 CAT gene could subse- 
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quently be detected in root nodules on transformed 
plants as the activity of the produced chloram- 
phenicol acetyl transferase enzyme. 

The conclusion can subsequently be made that the 
5 promoter-containing 5' flanking regions of root 
nodule - specif ic genes exemplified by the soybean 
Lbc3 promoter are functionally active in foreign 
plants. The latter is a surprising observation as 
root nodules are only developed as a consequence 
10 of a very specific interaction between the legu- 
minous plant and its corresponding Rhlz obium micro- 
symbiont. 

Soybeans produce nodules only upon infection by the 
species Rhizob ium i aponicum and Lotus cornicula tus 

15 only upon infection by the species Rhizob ium lo t i . 
Soybean and Lotus cornicula tus belong therefore to 
two different cross - inoculation groups, each group 
producing root nodules by means of two different 
Rhizob ium species. The expression of a chimeric 

20 soybean gene in Lotus cornicula tus proves therefore 
an unexpected universal regulatory system applying 
to the expression of root nodule - specif ic genes. 
The regulatory DNA sequences involved can be placed 
on the 5' and 3' flanking regions of the genes, 

25 here exemplified by the 2.0 Kb 5' and 0.9 Kb 3' 
flanking regions of the Lbc3 gene. This surprising 
observation allows the use of root nodule - specif ic 
promoters and regulatory sequences in any other 
plant species and any other plant cell line. 

30 In other experiments the 5' flanking region of the 
nodule - spec 1 fic N23 gene was fused to the CAT g ne 
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and the Lbc3 3' flanking region in such a manner 
that: the expression of the CAT gene is controlled 
by the N23 promoter. This fusion fragment was cloned 
into the integration vector pAR22 producing the 
5 plasmid N23-CAT which was subsequently recombined 
into A . rhizogenes and transferred to Lotus corni- 
culatus and Trif olium re-pens (white clover) by the 
previously described method. The root nodule-spe- 
cific expression of the transferred N23-CAT gene 

Ijq obtained in L. corniculatus infected with Rhizobium 
loti and in T . renens infected with Rhizobium tri- 
f olii further demonstrated that expression of root 
nodule - specif ic genes is independent of the plant 
species and Rhizobium species. A universal regu- 

25 latory system therefore regulates the expression 
of root nodule- specif ic genes in the different 
symbiotic systems formed between legumes and the 
Rhizobium species of the various cross - inoculation 
groups . 

20lt is known from European Patent Application EP 
122, 791. Al that plant genes from one species, by 
Agrobacterium mediated transformation, can be trans- 
ferred into a different plant species. It is also 
known from EP 122, 791. Al that a transferred gene 

25 encoding the seed storage protein "Phaseolin" can 
be expressed into tobacco and alfalfa. From the 
literature it is also known that this expression 
is seed specific ( Sengupta-Gopalan et al . 1985, 
Proc. Natl. Acad. Sci. 82, 33203324). 

30The present invention therefore relates to a novel 
method for the expression of transferred genes in 
a root nodule- specif ic manner . using DNA regulatory 



0249676 



17 

sequences from the 5' promoter region, the coding 
region, or the 3' flanking region of root nodule- 
specific genes, here exemplified by the leghemoglo- 
bin Lbc 3 gene and the N23 gene. This method is 
5 distinct from both the method of Agrobac ter ium 
mediated transformation and expression of the seed 
storage protein phaseolin gene characterised in EP 
122, 791. Al. Expression of the transferred phaseolin 
gene in EP 122, 791. Al only demonstrates that the 

10 phaseolin gene family with its particular regulatory 
requirements can be expressed in tobacco and alfal- 
fa. It does not demonstrate nor predict that any 
other genes with their particular regulatory re- 
quirements can be expressed in any other plants or 

23 plant tissue. 

An object of the present invention is to provide a 
possibility of expressing desired genes in plants, 
parts of plants, and plant cell cultures. 

A further object of the invention is to render it 
20 possible to express genes of any origin by the 
control of an inducible root nodule - specif ic pro- 
moter . 

A particular object of the invention is to provide 
a possibility of expressing desired genes in legu- 
25 minous plants . 

A still further particular object of the invention 
is to provide a possibility of expressing root 
nodule-specific genes in non- leguminous plants. 



Further objects of the invention are to improve the 
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existing nitrogen- fixing systems in leguminous 
plants as well as to incorporate nitrogen-fixing 
systems in other plants. 

A further object of the invention is to provide a 
5 possibility of in certain cases allowing the use 
of specific sequences of the 3' flanking region, 
of the coding sequence, and of intervening sequences 
to influence the regulation of the root nodule - 
specific promoter. 

10 Furthermore it is an ob j ect of the invention to 
provide plasmids comprising the above mentioned 
inducible plant promoter. 



Further objects of the invention appear immediately 
from the following description. 

15 The method according to the invention for the ex- 
pression of genes in plants, parts of plants, and 
plant cell cultures is carried out by introducing 
into a cell thereof a recombinant DNA segment con- 
taining both the gene to be expressed and a 5' 

20 flanking region comprising a promoter sequence, and 
optionally a 3 f flanking region, and culturing of 
the transformed cells in a growth medium, said 
method being characterised by using as the recom- 
binant DNA segment a DNA fragment comprising an 

25 inducible plant promoter (as defined) from root 
nodule - specif ic genes. If desired the transformed 
cells are regenerated to plants. 



The method according to the invention allows in a 
well defined manner an expression of foreign genes 
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in plants, parts of plants, and plant cell cultures, 
in this connection especially genes providing the 
plants with desired properties such as for instance 
a resistance to plant diseases and increased content 
5 of valuable polypeptides. 

A further use is the preparation of valuable pro- 
ducts such as for instance dyes, flavourings, plant 
hormones , pharmaceutical products , primary and 
secondary metabolites, and polypeptides by means 
1q of the method according to the invention in plant 
cell cultures and plants. 

By using the method according to the invention for 
the expression of root nodule - specif ic genes it is 
possible' to express root nodule - specif ic genes 

•^5 necessary for the formation of an active nitrogen- 
fixing system both in leguminous plants and other 
plants. The correct developmental control, cf. 
Example 8, allows the establishment of a symbiotic 
ni trogen - f ixing system in non- leguminous plants. In 

20 this manner it is surprisingly possible to improve 
the existing nitrogen- fixing systems in leguminous 
plants as well as to incorporate nitrogen- fixing 
systems in other plants. 

The use of the method according to the invention 
25 for the expression of foreign genes in root nodules 
renders it possible to provide leguminous plants 
with improved properties such as resistance to 
herbicides and resistance to diseases and pest. 

Acc rding to a particular embodiment of th method 
3Qacc rding t the invention a DNA fragment is used 
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which comprises an inducible plant promoter and 
which is identical with, derived from, or comprises 
5' flanking regions of leghembglobins genes. In this 
manner the expression of - any gene is obtained. 

5 Examples of such DNA fragments are DNA fragments 
of the four 5* flanking regions of the soybean 
leghemoglobin genes, viz. 

Lb a with the sequence: 

10 GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 
GATATATACC TTCTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 
TCTTTTATTT TTATAAAAAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 
ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 
GGTTAAATCT CATAGTGCCT CTATTCAATA ATTTGGGCTC AATTTAATTA 
GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 
AAAGTTGGTT TTTCTCGAGG A AG A AAG G A A ATGTTAAAAA CTGTG ATATT 

15 TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 
CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATTTGAT 
TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCGT TGTTTCTTCT 
TCATCATGCT GATTGACACC CTCCACAAGC CAAGA GAAAC ACATAAGCTT 
TGGTTTTCTC ACTCTCCAAG CCCT CTATAT AAA CAAATAT TGGAGTGAAG 
TTGTTGCATA ACTTGCATCG AACAATTAAT AGAAATAACA GAAAATTAAA 
AAAGAAATAT G, 

2o Loci with the sequence: 



TTCTCTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 

AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 

CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 

TTTCCAAACC TGTAGATTTA TTTATTT ATT TATTTATTTT TACAAAGGAG 

ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 

TAATAAACTT TAAAATCAAA CTTTTTTATA TTTTTTGTTA CCCTTTTCAT 

25 TATTGGGTGA AATCTCATAG TGAAGCCATT AAATAATTTG GGCTCAAGTT 

TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 

GAAAGGGAGC GAATGTTAAA AAGTGTGATA TTATATTTTA TTTCGATTAA 

TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATTCAGAAT 

AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 

TAGGATTTTG AAA AG AT CAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 

CACAAGCCAA GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 

ATATAAACAT GTATTGGATG TGAAGTTATT GCATAACTTG CATTGAACAA 

30 TAGAAAATAA CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG , 
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Lbc£ with the sequence: 

TCGAGTTTTT ACTGAACATA CATTTATTAA AAAAAACTCT CTAGTGTCCA 

TTTATTCGGC GAGAAGCCTT CTCGTGCTTT ACACACTTTA ATATTATTAT 

ATCCCCACCC CC AC C A A AAA AAAAAAAACT GTTATATCTT TCCAGTACAT 

TTATTTCTTA TTTTTACAAA GGAAACTTCA CGAAAGTAAT TACAAAAAAG 

q ATAGTGAACA TCATTTTTTT AGTTAAGATG AATTTTAAAA TCACACTTTT 

TTATATTTTT TTGTTACCCT TTTCATTATT GGGTGAAATC TCATAGTGAA 

AC^ATTAAAT AGTTTGGGCT CAAGTTTTAT TAGTAAAGTC TGCATGAAAT 

TTAACTTAAT AATAGAGAGA GTTTTGGAAA GGTAACGAAT GTTAGAAAGT 

GTGATA'T'TAT TATAGTTTTA TTTAGATTAA TAATTATGTT TACATGAAAA 

TTGACAATTT ATTTTTAAAA TTCAGAGTAA TACTTAAATT ACTTATTTAC 

TTTAAGATTT TGAAAAGATC ATTTGGCTCT TCATCATGCC GATTGACACC 

CTCCACAAGC CAAG AGAAAC TTAAGTTGTA ATTTTTCTAA CTCCAAGCCT 

ioi TCTVTATAAA CACGTATTGG ATGTGAAGTT GTTGCATAAC TTGCATTGAA 

CAATAGAAAT AACAACAAAG AAAATAAGTG AAAAAAGAAA TATG, 



and Lbc3 with, the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 

15 GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 1 
TTGTTE"TAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 

-iry AATTTTAACT TAAAAATAGA GAAAATC TGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
CTAAAAAAAT ATATATT A A A ATTTTAAATT CAGAATAATA CTT A A ATT AT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TAAATAAGTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 

?5 CAGAAAAGTA GAAAAGAAAT ATG* 

A further embodiment of the method according to 
the invention uses a DNA fragment identical with, 
derived from or comprising 5' flanking regions of 
the Lbc 3 - 5 ' - 3 ' - CAT gene with the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 

GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 

GTAGATT^VT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 

AT A A A A AT AG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 

AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 

TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGx^TCA 
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TATAAAAAAA ATT GTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTCA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT- AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAG GG ACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
5 TTATTT ACT G AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAG CCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTCTATA T AA AT A AG T A 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA G AATTCT AAA ATG 

10 A still further preferred embodiment of the method 
according to the invention uses a DNA fragment 
identical with, derived from or comprising 5' flank- 
ing regions of the N23 gene with the sequence 



lO 20 30 40 50 60 70 

GWCTTCGAGCTCGCCCGGGGATCGATCCTCTA 
Sail 



So 90 lOO 110 120 130 140 

15 TTCTATTGAGACACGATTTGAACAATTTTTACAOTAT^ 

aaatctaaaSctotagatgat^ 

220 230 240 250 260 270 280 

atgaatgctatgatattgatggtcttgatntattnncagaact 

290 300 310 320 330 340 350 

AGAAGTTAGCACACCAATAGAAGTATTGAGTTATATTAAAACTTT^ 

360 370 3BO 390 400 410 420 

CATATAGAATTTTATTGACAATCCTTA^ 

430 440 450 460 470 480 490 

20 ACTTAAATCATATCTAAAATCAACAAT GTTACAAGATAGATT GAATGAGTTAGTTATTTT ATCTATTGAA 

500 510 520 530 540 550 560 

AGTAAAGTGTTAGAATTGTTTGATTATAAAACTCTGATAAATGATTTTGCAGTTAAAA 

570 580 590 60O. .610 620 630 

TAATATAAAAATTGATATTTTATATAATATATTAAGTCTCTTTAAAAW 

640 6SO 660 670 6SO 690 700 

AAATAATAAAAXAAAGCAACTCTTAAOrTTTAATGAAACATC^ 

710 720 730 740 750 760 770 

AAAAATTAATGCTTGATGGAAGTTTTT AATTTGTTCTACT CAATACT CAAAGG GTTGT AAAT ATTTTTTT 

780 790 800 810 820 830 840 

P5 TATCATTTATATGlrrGTAAATATGAATGCACTAGTAATTAGTTTAATGATAAAATATAT^ 
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850 860 670 680 890 900 9lO 

ATTTCTGTCTCTTGGGAACTCGTGAGAATTGAATATATTATAAAGATGAAAGGTCGT^ 

920 930 940 950 960 970 980 

AGAATAAATATTTATATACAATTCCTAGATTTTGTTATAAAATTCACATATTGTATGAGTATA^ 

990 1000 IO10 1020 1030 1040 1050 

GAGCACACACCAAACTAGTCTCAAATTAACTAAGGTGCTAATTATTAGCGGCTAGCTAAGTAACCAAGTA 

Ddel 

ATTAATG 



5 In a particularly preferred embodiment of the method 
according to the invention a 3' flanking region of 
root nodule - spec if ic genes is furthermore used, in 
particular sequences of the 3' flanking region ca- 
pable of influencing the activity or regulation of 
lO a promotor of the root nodule - specif ic genes or 
the transcription termination, or capable of in- 
fluencing the yield of the desired gene product in 
another manner. 

Examples of such 3' flanking regions are the four 
15 3* flanking regions of the soybean leghemoglobin 
genes , viz . 

Lba with the sequence: 



1590 162o 
TAA TTA GTA TCT" ATT GCA GTA AAG TGT AAT AAA TAA ATC TTG 



1650 1680 
TTT CAC TAX AAA ACT TGT TAC TAT TAG ACA AGG GCC TGA TAC AAA ATG TTG GTT AAA ATA 

1710 1740 
20 ATG GAA TTA TAT AGT ATT GGA TAA AAA TCT TAA GGT TAA TAT TCT ATA TTT GCG TAG GTT 

1770 1600 
TAT GCT TGT GAA TCA TTA TCG GTA TTT TTT TTC CTT TCT GAT AAT TAA TCG GTA AAT TA 

1830 I860 
ACA AAT AAG TTC AAA ATG ATT TAT ATG TTT CAA AAT TAT TTT AAC AGC AGG TAA AAT GTT 



ATT TGG TAC GAA AGC TAA TTC GTC GA 



0249676 



24 



LbC}_ with the sequence : 



1320 

TAA/TT AGG ATC TAG TGC ATT GCC GTA 



1350 1380 
AAG TGT AAT AAA TAA ATC TTG TTT CAA CTA AAA CTT GTT ATT AAA CAA GTT CCC TAT ATA 

1410 1440 
AAT GTT GTT TAA AAT AAG TAA ATT TCA TTG TAT TGG ATA AAC ACT TTT AAG TTA TAT ATT 

1470 150O 
5 TCC ATA TAT TTA CGT TTG TGA ATC ATA ATC GAT ACT TTA TAA AAA TAA ATT CCA AAT AAT 



TTA TAC GTT TTA AAA ATT ATT TT 



Lbc2 with the sequence: 



TAG/GAT CTA CTA TTG CCG TCA AGT 

1140 

GTA ATA AAT AAA TTT TGT TTC ACT AAA ACT TGT TAT TAA ACA AGT CCC CGA TAT ATA AAT 

1170 12 OO 

If) GTT GGT TAA AAT AAG TAA ATT ATA CGG TAT TGA TAA ACA ATC TTA AGT TTT ATA TAT AGT 
*^ 1230 1260 

TCC ATA TAC TAA AGT TTG TGA ATC ATA ATC GA 

1290 



and Lbc3 with the sequence: 



TAG/GAT CTA CAA TTG CCT TAA AGT GTA ATA AAT AAA 
990 1020 

TAT TAT TTC ACT AAA ACT TGT TAT TAA ACC AAG TTC TCG ATA TAA ATG TTG GTT AAA CTA 

1050 lOSO 



1 c AGT AAA TTA TAT GGT ATT GGA TAA ACA ATC TTA AGC TT 

1110 



This sequence is positioned on the 0.9 Kb 3' flan- 
king region used according to the invention. A 
particular embodiment of the invention is therefore 
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the use of sequences of this region exerting or 
mediating the regulation characterised by the in- 
vention of root nodule - specif ic promoter regions. 

In a preferred embodiment of the method according 
5 to the invention a region is used of the coding 
sequence or intervening sequence of root nodule- 
specific genes , in particular sequences of the 
coding sequence or the intervening sequence capable 
of influencing the regulation of a promotor of the 
10 root nodule - specif ic genes or capable of influenc- 
ing the yield of the desired gene product in another 
manner . 



Examples of such coding sequences and intervening 
sequences are the four leghemoglobin genes of soy- 
15 bean , viz . 



Lb a with the sequence: 120 

VAL 

ATG/GTT 

150 ISO 

ALA PHE THR GLU LYS GLN ASF ALA LEU VAL SER SER SER PHE GLU ALA PHE LYS ALA ASN 
GCT TTC ACT GAG AAG CAA GAT GCT TTG GTG AGT AGC TCA TTC GAA GCA TTC AAG GCA AAC 

210 240 

ILE PRO GLN TYR SER VAL VAL PHE TYR THR SER 

ATT CCT CAA TAC AGC GTT GTG TTC TAC ACT TC/G TAA GTT TTC TCT CTA AGC ATG TGT CTT 

270 300 
20 CCA TTC TAT GTT TTT CTT TTG GAA ATT TGT TGT GTT TGA AAA AAG ATA TAT TGT TAA TGT 

330 360 

ILE LEU GLU LYS ALA PRO ALA ALA LYS ASP 
GAG TGG TTT TGG TTT GAT TAA AAA TGA ATAG/G ATA CTG GAG AAA GCA CCT GCA GCA AAG GAC 

390 420 
LEU PHE SER PHE LEU ALA ASN GLY VAL ASP PRO THR ASN PRO LYS LEU THR GLY HIS ALA 
TTG TTC TCA TTT CTA GCA AAT GGA GTA GAC CCC ACT AAT. CCT AAC CTC ACG GGC CAT GCT 

450 480 

GLU LYS LEU PHE ALA LEU 

GAA AAG CTT TTT GCA TTG/GTAA GTA TCA CCC AAC TAA AAT TAT AAC TAT TTT ATG TGA 

5IO 540 
TTA ATT TTA AGA TTA AGC ATC ATG TAT TTT AAC . ACT CTT AAA ACA TCA ATG AAC ATT AAT 

570 600 
25 TGT TTG AAT TGT ATT TTA TAT TTT TGC CAT ATC TTG AAC TAG GAA TAG TAT ATA AAT TTC 

630 660 
TAT TAG TAT TTG TTG ATA ATT ATT TTT CTT TCA TAA CTA TCT TGT CAC ATA TTA TAT ATT 
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690 720 
VAL ARG ASP SER ALA GLY GLN LEU LYS ALA SER GLY THR VAL VAIr ALA 
TTT TGA ATT GTAG/GTG CGT GAC TCA GCT GGT CAA CTT AAA GCA AGT GGA ACA GTG GTG GCT 

750 780 
ASP ALA ALA LEU GLY SER VAL HIS ALA GLH LYS ALA VAL THR ASP PRO GLN PHE VAL 
GAT GCC GCA CTT GGT TCT GTT CAT GCC CAA AAA GCA GTC ACT GAT CCT CAG TTC GTG/GT 

810 840 
ATG ATA AAT AAT GAA ATG TTA TAA TAA ATT ATG CAT ACT TCA ATT TTT CAT GGA GCA GTA 

870 90O 
TAA TGA TCA ACA CAC ACT TCT TTT GTT TCA TGC ATT TGA TAA CTA CAA TCT TAA AAT GTT 

930 9SO 
5 GCA ATC TTA AAA ATA GTA TTA AAA ATA TAA CAT TTA ATT AGC TCA TCA ATA TTT TTC TGT 

990 1020 
TGC AAT TTT TTA TGA AAA AAT TAT AAT TAT GAA TTC TTT GAG CAA TGT TTA ATT AAA AAA 

1050 lOSO 
TTG ATT TAA TAA TGA AAT AAC TAA GCT ACC TCT GTC TCG TTT TTC ATT TAA ACT ATG ACA 

1110 1140 
TAA ACA ATG AAT AAA GTA AAC TAA ACC ATG ACA TGT TTA TTT TTG AAT GAG GTT ATT AAT 

1170 1200 
AAT TTT TTT TCA CTA TCT ATT GCA ATG TTC ATT GAT TAT CAA TTA TCT TGG TTG CAT TGA 

1230 12€Q 
j[_Q TTC TCT CGA TTT TTT TCT TGA GGT TAA GCT TCA GTT CAA TAT ATA TTC. ATT TTT TGA TAA 

1290 1320 
AAA AAA ATA GTA CAA TAT ATT TTC ATT TAG CTG ATC ATA TTT ATT TAA GTT CAA CTT AAA 

13 SO 13SO 
ATT TTA TAG ATG TTA ATT GAT ATA ATT TGT TGA GAT GAT GAG AAG ACC AAT ACC ATT ACG 

1410 14 40 

TAC TCT TTT GAA AGT GTT ATA TGG ATT TTA ATT ATA AGG.AAA AAT GTA AGA GCT AAA CCA 

14 70 15 OO 

VAL VAL LYS GLU ALA LEU LEU LYS THR ILE LYS ALA ALA VAL 
1 g TTG CTG ATG ATT TTG AAG/GTG GTT AAA GAA GCA CTG CTG AAA ACA ATA AAG GCA GCA GTT 

1530 1S60 
GLY ASP LYS TRP SER ASP GLU LEU SER ARG ALA TRP GLU VAL ALA TYR ASP GLU LEU ALA 
GGG GAC AAA TGG AGT GAC GAG TTG AGC CGT GCT TGG GAA GTA GCC TAC GAT GAA TTG GCA 



ALA ALA ILE LYS LYS ALA 
GCA GCT ATT AAG AAG GCA TAA 



The amino acid sequence of the Lba protein is in- 
20d*- cate d- above the coding sequence, 



Lbc^ with the sequence 
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160 
GLY 
ATG/GCT 

210 240 
ALA PHE THR GLU LYS GLN GLU ALA LEU VAL SER SER SER PHE GLU ALA PHE LVS ALA ASH 
GCT TTC ACT GAG AAG CAA GAG GCT TTG GTG AGT AGC TCA TTC GAA GCA TTC AAG GCA AAC 

270 300 
ILE PRO GLN TYR SER VAL VAL PHE TYR ASN SER 

ATT CCT CAA TAC AGC GTT GTG TTC TAC AAT TC/GTAA GTT TTC TCT ATA AGC ATG TGT CTT 

330 360 
TCA TTC -TAT GTT TTT CTT CTG GAA ATT TTT TGT GTT TGA AAA AAG ATA TAT ATA TAT ATA 

390 420 
5 TAT ATA TAT ATA TAT ATA TAT ATA TAT ATA TAT ATA TAT TTT GTT AAT GTG AGT GGT TTT 

450 480 

ILE LEU GLU LYS ALA PRO ALA ALA LYS ASP LEU PHE SER 
GGT TTG ATT AAA AAT AAA TAG/GATT CTG GAG AAA GCA CCT GCA GCA AAG GAC TTG TTC TCA 

S10 540 
PHE LEU ALA ASM GLY VAL ASP PRO THR ASN PRO LYS LEU THR GLY HIS ALA GLU LYS LEU 
TTT CTA GCA AAT GGA GTA GAC CCC ACT AAT CCT AAG CTC ACG GGC CAT GCT GAA AAG CTT 

570 600 

PHE ALA LEU 

TTT GCA TTG/GT AAG TAT CAG CCA ACT AAA ATT ATA ACT ATT TTA TGT GAT TAA TTT TAA 

630 660 
GAT TAA ACA TCA TGT ATT TTA ACA CTC TTA AAA TAT CAA TGA ACA TTA ATT TTT TGA ATT 

690 720 
10 GTA TTT TAT ATT TTT ACC ATA TCT TGA ACT AGG AAT AAT ATA TAA ATT TCT ATT AGT ATT 

750 780 
TGT TGG TAA TTA CAT ATA TAT ATA TAT ATA TAA TCC TTG TGA TAA TTA TTT TTC GAA TTT 

BIO 840 
VAL ARG ASP SER ALA GLY GLN LEU LYS THR ASN GLY THR VAL VAL ALA ASP ALA ALA 
GTAG/GTG CGT GAC TCA GCT GGT CAA CTT AAA ACA AAT GGA ACA GTG GTG GCT GAT GCT GCA 

870 900 

LEU VAL SER ILE HIS ALA GLN LYS ALA VAL THR ASP PRO GLN PHE VAL 

CTT GTT TCT ATC CAT GCC CAA AAA GCA GTC ACT GAT CCT CAG TTC GTG/GT ATG ATA AAT 

930 96b 
AAT ACT AGT AAA ATG TTA CAA TAA ATG CAA ACT TAA GTT TTA CGT ACA TAG TGA TCA TGA 

990 1O20 
15 CTT CAT GCA TGG CTA TTA TTT TTT CAT ATT TAT TGA AGT CAA CTT AAA ATT TTG TAA ATA 

1050 loeo 

CAG ATC GAT GCT AGT AAT TTG TTG AGA TCA TGA GAA AAC GTA CCA CTA CTC CAA TAG CAT 

1110 1140 
TAC TCA TTT TGA AAA TTG TAT AAC TGT GAT CTA ATT ATA AGG AAA AAG TGT ATA TAA GAS 

1170 1200 
VAL VAL LYS * GLU ALA LEU LEU LYS THR 
CTA ATC CAT TAT TAA TGT TTT TTA TAT TTT GTAG/GTG GTT AAA GAA GCA CTG CTG AAA ACA 

12 30 1260 
ILE LYS GLU ALA VAL GLY GLY ASN TRP SER ASP CLU LEU SER SER ALA TRP GLU VAL ALA 
ATA AAG GAA GCT GTT GGC GGC AAT TGG AGT GAC GAA TTG AGC AGT GCT TGG GAA GTA GCC 

1290 

TYR ASP GLU LEU ALA ALA ALA ILE LYS LYS ALA 
20 TAT GAT TTG GCA GCA GCA ATT AAA AAG GCA TAA 



The amino acid 
Indicated above 



sequence of the Lhc^ protein 
the coding sequence, 



is 
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Lbc2 with, the sequence : 



GLY 
G/6GT 
180 

ALA PHE THR GLU LYS GLN GLU ALA LEO VAL SER SER SER PHE GLU ALA PHE LYS ALA ASK 
GCT TTC ACT GAG AAG CAA GAG GCT TTG GTG AGT AGC TCA TTC GAA GCA TTC AAG GCA AAC 

210 240 

ILE PRO GLN TYR SER VAL VAL PRE TYR THR SER 

ATT CCT CAA TAC AGC GTT GTG TTC TAC ACT TC/GTA AGT TTT CTC TTA AAG CAT GTA TCT 

270 300 

5 TTC ATT CTC TGT TTT TCC TTT CGA CAT TTT TTG TGT TTG AAA AGA GAT AGT GTC AAT GTG 

330 360 

ILE LEU GLU LYS ALA PRO ALA ALA LYS 

AGT GGG TAT TTT TTT TTA TTA AAA ATT AAC AG/G ATA CTG GAG AAA GCA CCC GCA GCA AAG 

390 420 

ASP LEU PHE SER PHE LEU SER ASM GLY VAL ASP PRO SER ASN PRO LYS LEU THR GLY HIS 
GAC TTG TTC TCG TTT CTA TCT AAT GGA GTA GAT CCT AGT AAT CCT AAG CTC ACG GGC CAT 

4SO 480 

ALA GLU LYS LEU PHE GLY LEU 

GCT GAA AAG CTT TTT GGA TTG/GTA AGT ATC ATC CAA CTA AAA TTA TAG CTA TTT TAT GTG 

SIO 540 

lO A TT AAT TTT AAG ATT AAA CAT GTA TTT AAC ACT CTT AAA CAT GTA TTT AAC ACT CTT AAG 

570 600 

ATT AAA CAT GTA TTT AAC TAA AAC ATG TAT TTG CTG ATT ATT TTT TTT TTA TAA TTA TCT 

630 660 

VAL ARG ASP SER ALA GLY GLN LEU LYS ALA 
TGT CAC ATA TTA TAT ATT TTT TGA ATT GTA G/GTG CGT GAC TCA GCT GGT CAA CTT AAA GCA 

690 720 

ASN GLY THR VAL VAL ALA ASP ALA ALA LEU GLY SER ILE HIS ALA GLN LYS ALA ILE THR 
AAT GGA ACA GTA GTG GCT GAT GCC GCA CTT GGT TCT ATC CAT GCC CAA AAA GCA ATC ACT 

750 780 

1 e ASP PRO GLN PHE VAL 

-LJ GAT CCT CAG TTC GTG/GT ATG ATA AAT AAT AAA ATG TTA CAA TAA ATG CAC ATA TAC TTA 

810 840 

AAT TTT ACA TGG TGC AGT GTT ATG ATC ATC ATT TTT GTT TAG TAA TGA ATT TAC TTA AAA 

870 900 

TCT TAA ATT ATG TAC TTT TTG AAA GTT TTA TAT GGA ATT TTA ATT ATA GGG AAA AAT GTA 

930 960 

, ^ VAL VAL LYS GLU ALA LEU LEU LYS THR 

AGA GCT AAT CCA TTA GTG ATG TTT TGT CTG TAG/GTG GTT AAA GAA GCA CTG CTG AAA ACA 

990 1020 

LE LYS GLU ALA VAL GLY ASP LYS TRP SER ASP GLU LEU SER SER ALA TRP GLU VAL ALA 
ATA AAG GAG GCA GTT GGG GAC AAA TGG AGT GAT GAA TTG AGC AGT GCT TGG GAA GTA GCC 

1050 1080 



20 T ? R ASP GLU LEU ALA ALA ALA ILE LYS LYS ALA PHE 

TAT GAT GAA TTG GCA GCA GCT ATT AAG AAG GCA TTT TAC 

1110 
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The amino acid sequence of the Lbc2 protein is 
indicated above the coding sequence, 

and LDC3 with the sequence: 

GLY ALA PHE THR ASP 
G/GGT GCT TTC ACT GAT 

* 120 

LYS GLN GLU ALA LEU VAL SER SER SER PHE GLU ALA PHE LYS THR ASN ILE PRO GLN TYR 
* AAG CAA GAG GCT TTG GTG AGT AGC TCA TTT GAA GCA TTC AAG ACA AAC ATT CCT CAA TAC 
^ ISO ISO 

SER VAL VAL PHE TYR THR SER 

AGT GTT GTG TTC TAC ACC TC/GTA AGT ATT CTA TCT AAA TTA TGT GTC TTA TTG TAT GTT 

210 240 

TAA CTT TCG TGG TTT GTT GTG TTT GAA AAA AAG ATA TAT ATT GTT AAT GTG AGT GGT TTT 

270 300 

ILE LEU GLU LYS ALA PRO VAL ALA LYS ASP LEU PHE SER 
GGT TTG ACT AAA AAT GAA TAG/G ATA CTG GAG AAA GCA CCT GTA GCA AAG GAC TTG TTC TCA 

330 360 

PHE LEO ALA ASN GLY VAL ASP PRO THR ASN PRO LYS LEU THR GLY HIS ALA GLU LYS LEU 
TTT CTA GCT AAT GGA GTA GAC CCC ACT AAT CCT AAG CTC ACG GGC CAT GCT GAA AAA CTT 

390 420 

PHE GLY LEU 

TTT GGA TTG/GT AAG TAT CCA GCC TAC TAA AAT TAA AAT CCT ATT AGT ATT TTT TAT TAT 

450 480 

VAL ARG ASP SER 

TTT TCT TCC ATG ATT GTC TTG TCA CAT ATT ATA TAT TTT TTG AAT TAT AG/GTA CGT GAT TCA 

510 540 

ALA GLY GLN LEU LYS ALA SER GLY THR VAL VAL ILE ASP ALA ALA LEU GLY SER ILE HIS 
GCT GGT CAA CTT AAA GCA AGT GGA ACA GTG GTG ATT GAT GCC GCA CTT GGT TCT ATC CAT 

570 600 

15 ALA GLN LYS ALA ILE THR ASF PRO GLN PHE VAL 

GCC CAA AAA GCA ATC ACT GAT CCT CAA TTT GTG/G TAT GAT AAA TAA TGA AAA GCT ACA 

630 66O 

ATA AAT GCA CAA ATA CTT AAT TTT ACA TAG TGC AGT GCT ATA TGA TCA TCA CTT TTG CTT 

690 720 

AGT AAT GAA TTT ACT TTT TTT TTT TAC AGA AGT AAT GGA TTT ACT TAA AAT CTT AAA TTA 

750 780 

TGT ACT TCT TTA AAG AGT TTT GTA TGG AAT TTT AAT TAT AGG AAA AAT GTA AGA GCT AAA 

810 840 

VAL VAL LYS GLU ALA LEU LEU LYS THR ILE LYS GLU ALA 
CCA TTG CTG ATG ATT TCG AAG/GTG GTT AAA GAA GCA CTG CTG AAA ACA ATA AAG GAG GCA 

870 90O 

20 VAL GLY ASP LYS TRP SER ASP GLU LEU SER SER ALA TRP GLU VAL ALA TYR ASP GLU LEU 
GTT GGG GAC AAA TGG AGT GAC GAG TTG AGC AGT GCT TGG GAA GTA GCC TAT GAT GAA TTG 

930 , 960 

ALA ALA ALA HE LYS LYS ALA PHE 
GCA GCA GCT ATT AAG AAG GCA TTT TAG 
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The amino acid sequence of the 1*003 protein is 
indicated above the coding sequence. 

The present invention furthermore deals with a 
novel DNA fragment comprising an inducible plant 

5 promoter to be used when carrying out the method 
according to the invention, said DNA fragment being 
characterised by being identical with, derived 
from or comprising a 5' flanking region of root 
nodule -specific genes. Examples of such DNA 

10 fragments are DNA fragments being identical with, 
derived from or comprising a 5' flanking region of 
plant leghemoglobin genes. Preferred examples are 
according to the invention DNA fragments being 
identical with, derived from or comprising a 5' 

15 flanking region of the four soybean leghemoglobin 
genes , viz . : 

Lba with the sequence: 



GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 
GATATATACC TTCTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 

>q TCTTTTATTT TTATAAAAAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 
ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 
GGTTAAATCT CAT AGTG CCT CTATTCAATA ATTTGGGCTC AATTTAATTA 
GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 
AAAGTTGGTT TTTCTCGAGG AAGAAAGGAA ATGTTAAAAA CTGTGATATT 
TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 

25 CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATTTGAT 
TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCGT TGTTTCTTCT 
TCATCATGCT GATTGACACC CTCCACAAGC CAAG AGAAAC ACATAAGCTT 
TGGTTTTCTC ACTCTCCAAG CCCTC TATAT AAA CAAATAT TGGAGTGAAG 
TTGTTG C ATA ACTTGCATCG AACAATTAAT AGAAATAACA G A A A ATT AAA 
AAAGAAATAT G, 
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Lbc^ with the sequence 



TTC7CTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 
AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 
CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 
TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TAC A A AG GAG 
ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 
TAATAAACTT TAAAATCAAA CTTTTTTATA TTTTTTGTTA CCCTTTTCAT 
TATTGGGTGA AATCTCATAG TGAAGCCATT AAATAATTTG GGCTCAAGTT 
TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 
GAAAGGGAGC GAATGTTAAA AAGTGTGATA TTATATTTTA TTTCGATTAA 
TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATTCAGAAT 
AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 
TAGGATTTTG AAAAGATCAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 
CACAAGCCAA GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 
AT ATA A AC AT GTATTGGATG TGAAGTTATT GCATAACTTG CATTGAACAA 
TAGAAAATAA CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG, 



Lbc2 with the sequence: 



TCGAGTTTTT ACTGAACATA CATTTATTAA AAA AAAC TCT CTAGTGTCCAi 
TTTATTCGGC GAGAAGCCTT CTCGTGCTTT ACACACTTTA ATATT AJTAT ! 
ATCCCCACCC CCACCAAAAA AAAAAAAACT GTTATATCTT TCCAGTACAT ! 
TTATTTCTTA TTTTTACAAA GGAAACTTCA CGAAAGTAAT TAC A A AAA AG ! 
ATAGTGAACA TCATTTTTTT AGTTAAGATG AATTTTAAAA TCACACTTTT: 
TTATATTTTT TTGTTACCCT TTTCATTATT GGGTGAAATC TCATAGTGAA-" 
ACTATTAAAT AGTTTGGGCT CAAGTTTTAT TAGTAAAGTC TGCATGAAAT. 
TTAACTTAAT AATAGAGAGA GTTTTGGAAA GGTAACGAAT GTTAGAAAGT. 
GTGATATTAT TATAGTTTTA TTTAGATTAA TAATTATGTT TACATGAAAA 
TTGACAATTT ATTTTTAAAA TTCAGAGTAA TACTTAAATT ACT T ATT TAC 
TTTAAGATTT TGAAAAGATC ATTTGGCTCT TCATCATGCC GATTGACACC 
"CTCCACAAGC C A A G AG AAAC TTAAGTTGTA ATTTTTCTAA CTCCAAGCCT 
TC TATATAAA CACGTATTGG ATGTGAAGTT GTTGCATAAC TTGCATTGAA 
CAATAGAAAT AACAACAAAG AAAATAAGTG AAAAAAGAAA TATG , 



and Lbe3 with the sequence: 



0249676 



32 



TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATT.-iCm.Arv 
ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AG AG CC ATT T 
5 TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAA AG GG ACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA AC A AG CCA AG AGAGACATAA 
lO GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTCTATA TAAATAAGTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAAAAGAAAT ATG* 



Another example of a preferred DNA fragment accord- 
ing to the invention is a DNA fragment which is 
15 identical with, derived from or comprises 5 r flank- 
ing regions of the Lb c 3 - 5 ' - 3 ' CAT gene with the 
sequence 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 

20 ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 

25 CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TAAATAAG TA 

CAGAAAAGTA GAATTCTAAA ATG 



30 Still another example af such a DNA fragment ac- 
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cording to the invention is a DNA fragment which 
is identical with, derived from or comprises 5' 
flanking regions of the N23 gene with the sequence 



10 20 30 40 50 60 70 

GAATTCGAGCTCGCCCGGGGATCGATCCTCTAGAGTCGACCTCCAGCC 
EooRI Sail 

80 90 lOO 110 120 130 14 O 

5 TTCTATTGAGACACGATTTGAACAATTTTTACATT^ 

150 160 170 180 19Q 2O0 210 

AAATTTAAAGCTTTAGATGATGATGAATTGA^ 

220 230 240 250 260 270 280 

ATGAATGCTATGATATTGATGGTCTTGATNTATTNNCAG^TTGAA^ 

290 300 310 320 330 340 350 

10 AGAAGTTAGCACACCAATAGAAGTATTGAGTTATATTAAA^ 

360 370 3SO 390 4O0 410 420 

C^TATAGAATTTTATTGACAATCCTTATAAC^GTTGCTACTGTTGAAAGACGTTCTTCAAAATTAAAA 

430 440 450 460 470 480 490 

ACTTAAATCATATCTAAAATCAAXZAATGTTACAAGATAGATTGAATGAGTTA 

500 510 520 530 540 550 560 

-| q ACTAAAGTGTTAGAATT G TTTGATTATAAAACTCTGATAAAT 

570 580 590 600 610 620 630 

TAATATAAAAATTGATATTTTATATAATATATTAACTCTCTTTAAAATTCTTGTAA 

640 650 660 670 680 690 700 

AAATAATAAAATAAAGCAACTCTTAATTTTAATGAAACATCCCTTTGTTAAACCGAAT^ 

7lO 720 730 740 750 760 770 

2q AAAAATTAATGCTTGATGGAAGTTTTTAATTTGTTCTACT 

780 790 800 810 820 830 840 

TATC^TTTATATGTTGTAAATATGAATGCACTAGTAATTAGTTTAATGATAAAATATATTCTACAGATAT 

850 860 870 880 690 900 910 

ATTTCTGTCTCTTGGCAACTCGTGAGAATTGAATATATTATA^ 

920 930 940 950 960 970 - 980 

2 5 AGAATAAATATTTATATACAATTCCTAGATTTTGTTATAAAATT 

990 1000 1010 1020 1030 1040 1050 

GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGCTAATTATTAGCGGCTAGCTAAGTAACCAAGTA 

Ddel 

ATTAATG 



0249676 



34 

The invention relates furthermore to any plasmid 
to be. used when carrying out the method according 
to the invention and characterised by comprising a 
DNA fragment containing an inducible plant promoter 
5 as herein defined. Particular examples of suitable 
plasmids according to the invention are pARll, 
pAR2 9, pAR30, and N23-CAT, cf . Examples 3, 4, and 
11. These plasmids allow recombination into the A . 
rhizogenes T DNA region, 

lO The invention relates furthermore to any Agrobac- 
terium strain to be used in connection with the 
invention and characterised by comprising a DNA 
fragment comprising an inducible plant promoter of 
root nodule- specif ic genes built into the T DNA 

25 region and therefore capable of transforming the 
inducible promoter into plants. Particular examples 
of bacterium strains according to the invention are 
the A. rhizogenes strains AR1127 carrying pAR29 , 
AR1134 carrying pAR30 a AR1000 carrying pARll, and 

20 AR204-N23-CAT carrying N23-CAT. 

It is obvious that the patent protection of the 
present invention is not limited by the embodiments 
stated above . 

Thus the invention employs not exclusively 5' flan- 
25 king regions of soybean leghemoglobin genes. It is 
well-known that the leghemoglobin genes of all 
leguminous plants have the same, function , cf. Apple- 
by (1974) in The Biology of Nitrogen Fixation, 
Quispel. A. Ed. North-Holland Publishing Company, 
3 0 Amsterdam, Oxford, pages 499-554, and concerning the 
kidney bean PvLbl gene it has furthermore been 
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proved that a high degree of homogoly exists with 
the sequences of the soybean LDC3 gene. It is also 
known that the expression of other root nodule- 
specific genes is regulated in a similar manner 
5 like the leghemoglob in genes . The invention includes 
thus the use of 5' flanking regions of leghemoglob in 
genes or other root nodule - spec if ic genes of all 
plants in case the use of such DNA fragments makes 
the expression of a desired gene product the subject 
lOmatter of the regulation characterised by the pre- 
sent invention. 

The present invention allows also the use of such 
fragments of any origin which under natural con- 
ditions exert or mediate the regulation charac- 
25terised by the present invention. The latter applies 
especially to such fragments which can be isolated 
from DNA fragments from gene libraries or genomes 
through hybridization with labelled sequences of 5' 
flanking regions of soybean leghemoglobin genes. 

2Qlt is well-known that it is possible to alter nuc- 
leotide sequences of non- important sub-regions of 
5' flanking regions without causing an alteration 
of the promoter activity and the regulation. It is 
also well-known that an alteration of sequences of 

25 important subregions of 5' flanking regions renders 
it possible to alter the binding affinities between 
nucleotide sequences and the factors or effector 
substances necessary or responsible for the trans- 
cription initation and the translation initiation 

3 Q and consequently to improve the promoter activity 
and/or the regulation. The present invention in- 
cludes, of course, also the use of DNA fragments 
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containing such altered sequences of 5 ' flanking 
regions , and in particular DNA fragments can be 
mentioned which have been produced by recombining 
sequences of 5' flanking regions of any gene with 
5 5' flanking regions of root nodule - specif ic genes 
provided the use of such DNA fragments subjects 
the expression of a desired gene product to the 
regulation characterised by the present invention. 

It should be noted that the transformation of micro- 
10 organisms is carried out in a manner known per se, 
cf . e.g. Maniatis et al . , (1982), Kolecular Cloning, 
A Laboratory Manual, Cold Spring Harbor Laboratory. 

The transformation of plant cells, i.e. introduction 
of plasmid DNA into plant cells , is also carried 
15 out in a manner known per se, cf. Zambryski et 
al., (1983), EMBO J. 2, 2143-2150. 

Cleavage with restriction endonucleases and di- 
gestion with other DNA modifying enzymes are well- 
known techniques and are carried out as recommended 
20 by the suppliers. 

The A grobacterium rhizogenes 15834 rif^ was used 
as a typical representative of A. rhizogenes: see 
White et al . , I.Bact., Vol. 141 (1980), 1134-1141. 

Example 1 

25 Sequence determination of 5' flanking regions of 
soybean le ghemo globin genes 

From a soybean gene library the four soybean leg- 
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hemoglobin genes . Lba, Lbc lf Lbc 2 , arid Lbc 3 are 
provided as described by Jensen, E.0. et al. , Nature 
Vol. 291, No. 3817, 677-679 (1981). The genetically 
stable in-bred invariable soybean species "Glycine 
5 max. var. Evans" was used as a starting material for 
the isolation of the DNA used for the construction 
of said gene library. The 5' flanking regions of 
the four soybean leghemoglob in genes are isolated, 
as described by Jensen, E.0., Ph D Thesis, Institut 
10 for Molekylar Biologi, Arhus Universitet (1985), 
and the DNA sequences determined by the use of the 
dideoxy method as described by Sanger, F., J. Mol. 
Bio. 143, 161-178 (1980) and indicated in the se- 
quence scheme . 

Example 2 

Construction of Lbc 3 - 5 9 - 3 ' ~CAT 

The construction has been carried out in a sequence 
of process steps as described below: 

a) Sub-clo ^inp; the Lbc^ gene 

2o The Lbc 3 gene was isolated on a 12Kb EcoRI restric- 
tion fragment from a soybean DNA library, which 
has been described by Wiborg et al . , in. Nucl. Acids 
Res. (1982) 10, 3487. A section of the fragment is 
shown at the top of the attached Scheme 2. This 

25 fragment was digested by the enzymes stated and 
then ligated to pBR322 as indicated at the Scheme. 
The resulting plasmids Lbc 3 HH and Lbc 3 HX were sub- 
sequently digested by PvuII and religated, which 
result d in two plasmids called pLpHH and pLpHX. 



38 



0249676 



b) Sub-cloning 5 'flanking sequences from the Lbc g 

gene 

For this purpose pLpHH was used as shown in the 

attached Scheme 3. This plasmid was opened by means 

5 of PvuII and treated with exonuclease Bal31. The 

reaction was stopped at various times and the 

shortened plasmids were ligated into fragments from 

pBR322. These fragments had been treated in advance 

as shown in Scheme 3, in such a manner that in one 

jjq end they had a DNA sequence ^ TC 

AAG 

* 



After the ligation a digestion with EcoRI took 
place, and the fragments containing 5' flanking 
sequences were ligated into EcoRI digested pBR322. 

15 These plasmids were transformed into E. col i K803. 
and the plasmids in the transf ormants were tested 
by sequence analysis. A plasmid, p213 5'Lb, isolated 
from one of the transf ormants , contained a 5 r flan- 
king sequence terminating 7 bp before the Lb ATG 

20 start codon in such a manner that the sequence is 
as follows : 

2Kb 

-5 ' flanking AAAGTAGAATTC 

Lbc3 sequence 

25 E. coli K803 is a typical representative of the E. 
coli K12 recipient strains. 

c) Sub-cloning 3' flanking reg ion of the Lbc^ 
gene 
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For this purpose pLpHX was used which was digested 
by XhoII. The ends were partially filled out and 
excess single-stranded DNA was removed with SI 
nuclease, as shown in the attached Scheme 4. The 
5 fragment shown was ligated into pBR322 which had 
been pretreated as shown in the Scheme. The con- 
struction was transformed into E. coll K803 . One 
of the trans formants contained a plasmid called 
Xho2a-3'Lb. As the XhoII recognition sequence is 
lO positioned immediately after the Lb stop codon, cf . 
Scheme 2, the plasmid contained about 900 bp of 
the 3' flanking region, and the sequence started 
with GAATTCTACAA . 

The construction of Lb promoter cassette 

15 An EcoRI/SphI fragment from Xho2a-3'Lb was mixed 
with a BamHI /EcoRI fragment from p213-5'Lb. These 
two fragments were, ligated via the BamHI/SphI cleav- 
age sites into a pBR322 derivative where the EcoRI 
recognition sequence had been removed, cf . Scheme 

20 4 . The ligated plasmids were transformed into E . 
coli K803 . A plasmid in one of the transf ormants 
contained the correct fragments, and it was called 
pEJLb 5 ' - 3' -1. 

Construction of the Lbc 3 5'3 f -CAT gene 

25 The CAT gene of pBR322 was isolated on several 
smaller restriction fragments, as shown in the 
attached Scheme 5. The 5' coding region was isolated 
as an Alul fragment which was subsequently ligated 
into pBR322, treated as stated in the Scheme. This 



0249676 



40 

was transformed into E. coli K803 . Several trans - 
formants contained the correct plasmid. One was 
taken, out and called Alull. The 3' coding region 
was isolated on a Taql fragment. This fragment was 
5 treated with exonuclease Bal31, whereafter EcoRI 
linkers were added. Then followed a digestion with 
EcoRI and a ligation to EcoRI digested pBR322. The 
latter was transformed into E . coli K803 and the 
transf ormants were analysed. A plasmid, Taq 12, 

10 contained the 3 r coding region of the CAT gene 
plus 23 bp 3 ' flanking sequences subsequently term- 
inating in the following sequence CCCCGAATTC. 
Subsequently the following fragments were ligated 
together to EcoRI digested 

15 pEJLb5' -3 ' -1: EcoRI/FvuII fragment from AluT , 
PvuII/Ddel fragment from pBR322 and Ddel/EcoRI 
fragment from Taq 12. This ligation mixture was 
transformed into E, coli K803 . Several trans formants 
contained the correct plasmid. One was taken out 

2Qand was called pEJLb 5' -3' CAT 15 . 

Example 3 
a . 

Cloning and integration of the soybean Lbc 3 *5'-3 f - 
CAT gene . 

25 Two EcoRI fragments (No. 36 and No. 40) of the T L - 
DNA region of A. rhizogenes 15834 pRi plasmid was 
used as "Integration sites". Thus the Lbc3~5 r -3- 
CAT gene was subcloned (as 3,6 Kb BamHI/Sall frag- 
ment) Into two vectors pARl and pAR22 carrying the 

30above EcoRI fragments. The resulting plasmids pAR29 
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and pAR30 were separately mobilized into A . rhi - 
zogenes 15834 rif R using a plasmid helper system; 
see E. van Haute et al . (1983), EMBO J. 3, 411- 
417. Neither pAR29 nor pAR30 can replicate in Agro- 
5 bacterium. Therefore the selection by means of 
rifampicin 100 /ig/ml and the plasmid markers spec- 
tinomycine 100 pg/ml , strep tomycine 100 /*g/ml or 
kanamycine 300 pg/ml will select A. rhizogenes 
bacteria having integrated the plasmids via homo- 

10 logous recombination through the EcoRI fragments 
36 or 40. The structure of the resulting T L -DNA 
regions - transferred to the transformed plant 
lines L5-9 and L6-23 - has been indicated at the 
bottom of the attached Scheme 6. In this Scheme is 

15 furthermore for the L6-23 line shown the EcoRI and 
Hindlll fragments carrying the Lbc3 - 5 ' - 3 ' - CAT gene 
and therefore hybridizing to radioactively labelled 
Lbc3»5 ' -3 ' -CAT DNA used as a probe, cf. Example 
4a. 

20 b.*. 

Cloning and integration of the soybean Lbc 3 gene. 

The EcoRI fragment No. 40 has here been used as 
"integration site". The Lbc3 gene was therefore 
sub-cloned (as a 3,6 Kb BamHI fragment into the 

25pARl vector and transferred into the T^-DNA region 
as stated in a. The structure of the T^-DNA region, 
transferred to the transformed plant line L8-35, 
has been shown at the bottom of the attached Scheme 
7. This Scheme furthermore shows the EcoRI and 

30HindIII fragments carrying the Lbc3 gene and there- 
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fore hybridizing with radioac tively labelled Lbc3 
DNA used as a probe, cf. Example 4b^. 
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Example 4 . 



a . 



DAnmTififcraMnTi of th * ««vWn Lbc i -V-V-fiAT gene in 
transformed plan ts of bird's-foot trefoil...^ 
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DNA extracted from transformed lines (L6-23) or 
untrans formed control plants and cleaved by the 
restriction enzymes EcoRI and Hindlll was analyzed 
by Southern-hybridization. Radioac tively labelled 
5 Lb c 3 - 5 r - 3 ' - CAT gene was used as a probe for demon- 
strating corresponding sequences in the transformed 
lines. The bands marked with numbers correspond to 
restriction fragments constituting parts of the 
Lb c 3 - 5 ' - 3 ' - CAT gene as stated in the restriction 
lO map (Scheme 6) of Example 3a. 

b . 

Demonstration of the soybean Lbc g gene of trans- 
formed plants of bird f s-foot trefoil. 
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DNA extracted from transformed lines (L8-35) or 
untransformed control plants and cleaved by the 
restriction enzymes EcoRI and Hindlll was analyzed 
by Southern-hybridization. Radioactive Lbc 3 gene 
5 was used as a probe for detecting corresponding 
sequences in the transformed lines. The bands marked 
with numbers correspond to restriction fragments 
constituting parts of the Lbc 3 gene as stated in 
the restriction map (Sch me 7) of Example 3b. 
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Example 5 
a . 

Expression of the Lb c 3 - 5 ' - 3 ' - CAT gene in v arious 
tissues of bird' s - f oot trefoil. 



1 2 3 4 5 6 

Untransformed L6-23 

R N LS R N LS 



8 \9 ,10 11 12 

L5 - 9 ./ Lbc 3 transformed 

N LS R N LS 



3Ac Cm 
1 Ac Cm 

Cm 
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The activity of the chlor oamphenicol acetyl trans- 
ferase (CAT) enzyme is. measured as the amount of 
acetylated chloroampheni col (AcCm) produced from 
^C-chloroamphenicol . In (a) the acetylated forms 
5 lAcCm and 3AcCm appear, which have been separated 
from Cm through thin-layer chromatography in chloro- 
f orm/methanol (95:5). The columns 1-3 show that no 
CAT activity occurs in root (R) , nodule (N) , as 
well as leaves + stem (LS) of untransf ormed plants 
lO of bird's-foot trefoil. The columns 4-6 and 7-9 
show the CAT activity in corresponding tissues of 
Lbc 3 - 5 ' - 3 ' -CAT transformed L6-23 and L5 - 9 plants. 
The conversion of chlqroamphenicol in columns 5 
and 8 shows the organ-specific expression of the 
15 Lbc3 - 5 ' - 3 ' - CAT gene in root nodules. The columns 
10-12 show the lack of CAT activity in plants trans- 
formed with the Lbc3 gene , 



b 



Table 



20 



Root 
Nodule 
Leaves + 
25 Stem 



L6-23 
CAT activity 
0 

68830 cpir/yg protein- h 



0 



L5-9 
CAT activity 
0 

154,000 cpnv/ug protein -h 



In the Table (b) the CAT activity in Lbc 3 - 5 ' - 3 ' - CAT 
transformed L5 - 9 and L6-23 plants has been stated 
as the amount of ^C-chloroaraphenicol converted 
into acetylated derivatives. The amount of radio- 
30 activity in the acetylated derivatives has been 
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counted by liquid scintillation and stated in cpm/^g 
protein * hour. 

Example 6 

Transcription test (North ern analysis^ on tissues. 
5 of Lb c 3 * 5 r - 3 ' - CAT transformed and Lbc 3 transformed 
Lotus plant lines . 



3 




5 6 
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8 9 10 
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b) 
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5 /ig of total RNA extracted from root (R) , nodule 
(N) or leaves + stem (LS) and separated in formal- 
dehyde agarose gels were transferred onto nitro- 
cellulose. Column 1 contains 5 Mg of total RNA from 
5 20 -day- old soybean nodules as control plants. The 
columns 2^-4 and 5-7 contain total RNA from root, 
nodule or leaves + stem, respectively, of the LDC3- 
5' -3' -CAT transformed lines L5-9 and L6-23. The 
columns 8-10 contain RNA from corresponding tissues 

10 of bird's-foot trefoil transformed by means of A . 
rhizogenes carrying the I/DC3 gene in the T^-DNA. 
In (a) radioactive DNA of the CAT coding sequence 
has been used as a probe for hybridization. The 
organ- specif ic transcription of the Lbc3-5'-3 # - 

15 CAT gene in root nodules from the L5-9 and L6-23 
lines appears from columns 3 and 6. In (b) the 
transcript for the constitutive ubiquitine gene(s) 
is visualized using a cDNA probe for the human 
ubiquitine gene for the hybridization. In (c) the 

20 nodule -specific transcription of bird' s- foot trefoil 
own leghemoglob in genes is shown. A cDNA probe of 
the Lba gene of soybean has been used for this 
hybridization . 



0249676 



50 



Example 7 



Determination of the transcript ion initiation site 
(CAP site 1 ) of the LDC 3 promoter of soybean in trans- 
formed root nodules of bird's-foot trefoil. 
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The position of the "CAP sit-e" was determined on 
the nucleotide level by means of primer extension. 
A synthetic oligonucleotide 5 ' CAACGGTGGTATATCCAGTG3 ' 
complementary to the nucleotides 15-34 in the coding 
5 sequence of the CAT gene was used as primer for 
the enzyme reverse transcriptase. As a result sin- 
gle-stranded cDNA was formed the length of which 
corresponds to the distance between the 5' end of 
the primer and the 5' end of the primed mRNA. A 83 

Iq nucleotide cDNA strand would be expected according 
to the knowledge of the transcription initiation 
site of soybean Lbc3 gene. Columns 2, 3, and 4 
from left to right show the produced DNA strands 
when the primer extension has been operated on 

15 polyA + -pur if ied mRNA from transformed root nodules 
of bird' s-foot trefoil, transformed leaves + stem 
of bird' s-foot trefoil, and untrans formed root 
nodules of bird's-foot trefoil, respectively. Tha 
85 , 86, 87, 88, and 90 nucleotides long cDNA strand 

2o shown in column 2 proved correctly I/DC3 promoter 
function in bird' s-foot trefoil. The CAP sites 
corresponding to the cDNA sequences generated are 
indicated with asterisks (*) on the partial se- 
quence of the Lbc3 5 ' 3 ' - CAT region given. In the 

25 sequence the TATA box of the Lbc3 promoter and the 
corresponding translation initiation codon of the 
CAT coding sequence are underlined. 
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Example 8 



Demonstration of the correct developmental control 
of the Lbc 3 - 5 * - 3 ' - CAT gene in transformed plants of 
bird's-foot trefoil (L6-23). 



in 



3 



CAT activity 

in cpm/^g protein-hour 0 
Nitrogenase activity 
nmol ethylene//xg protein 0 
- hour 
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5g 



342.3 



0.5 



4-t 
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I 
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in ca 

43 iff 
ca h£ 



1255' 



2.7 



* Substrate limited reaction; actual activity about 
68000 cpm/j&g protein ■ hour. 



Chloramphenicol acetyl transferase and nitrogenase 
activity were measured on cut off pieces of root 
with nodules at the different developmental stages 
indicated. The CAT activity can be detected in the 
white distinct nodules whereas the nitrogenase 
activity did not appear until the small pink nodules 
have developed. The latter development corresponds 
to the development known from soybean control plants 
and described by Marcker et al . EMB0 J. 1984, 3, 
1691*95. The CAT activity was determined as in 
Example 5. The nitrogenase activity was measured 
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as acetylene 
followed by 
ethylene. 

Example 9 




53 

reduction capacity of the nodules 
^.chromatographic determination of 



Proteins extracted from root nodules of Lbc 3 trans- 
formed (L8-35), Lbc 3 -5' -3' -CAT transformed and 
nontransformed plants were separated by isolectric 

10 focussing at a P H gradient of 4 to 5. The columns 
1, 3, 5, 7, and 9 show Lbc^ Lbc 2 , Lbc 3 , and Lba 
proteins synthesized in soybean control root nod- 
ules. Column 2 shows proteins from root nodules of 
Lbc 3 -5' - 3' - CAT transf ormed L6 - 23 -bird' s - foot trefoil 

!5 plants, whereas the columns 6 and 8 show proteins 
from nontransformed plants. The columns 4 and 10 
show soybean Lbc 3 protein synthesized in root nod- 
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ules of bird's-foot trefoil plants (L8-35) trans- 
formed with the Lbc3 gene. The LDC3 protein band 
is indicated by an arrow. 

Example 10 

5 Expression of the Lbc 3 - 5 ' - 3 ' - CAT gene re quires the 
5 f Lbc 3 promoter region. 

The Lb c 3 - 5 ' - 3 ' - CAT gene construction carries a 2 Kb - 
5' Lbc3 promoter region. Stepwise removal of se- 
quences from the 5' end of this region demonstrated 
IO that this promoter region is required for the char- 
acteristic expression of the Lbc 3 - 5 ' 3 ' - CAT gene. 



5'Lbc, 



3'Lbc, 



Sail 
I 



Xbal 
2 Kb- 



s> all 



The Lbc3~5 9 -3 r -CAT gene construction was opened in 
15 the unique Xbal site shown above, and digested with 
the exonuclease Bal31. A Sail linker fragment was 
ligated onto the blunt ends generated and the short- 
ened Sail fragments carrying the Lbc3 - 5 ' - 3 ' - CAT gene 
were transferred into L . eorniculatus . The effect 
2o of removing promoter sequences was measured as CAT 
activity. End points of the deleted 5 r region are 
given as the distance from the CAP site in nucleo- 
tides . 
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5'Lbc 



2000 



-950 r 

-230 
-78 



3'Lbc 



1 CAT- 
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-J — ■ — 
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CAT activity 
Cpm/^g protein/hrs 



Root 


Nodule 


Leaf 


0 


80000 


0 


0 


10000 


0 


0 


3000 


0 


0 


3000 


0 


0 


0 


0 



5 The drastically reduced level of CAT activity ex- 
pressed from the Lbc3 promoter deleted to nucleotide 
-230 and the zero activity from the promoter deleted 
to nucleotide -78 demonstrates that the Lbc3 pro- 
moter region is required for the root nodule spe - 
lO cific expression of the Lbc3 - 5 ' - 3 ' - CAT gene. 



Example 11 



Construction of the N23-CAT gene . 



The N23 gene was isolated from a soybean DNA library 
as described in the enclosed paper of Sandal, Bojsen 

15 and Marcker . The N23-CAT gene was constructed from 
the modified Lbc3 - 5 ' - 3 ' - CAT gene carried on plasmid 
pEJ5 ' - 3 ' -CAT101 as described in the Applicant's 
copending application No. 86 11 4704.9 concerning 
"Expression of Genes in Yeast", and a 1 Kb. EcoRI, 

2o^ de l fragment containing the N23 5' promoter region. 
The position of the EcoRI and Ddel sites in the 
N23 promoter region is indicated on the DNA sequence 
shown below. The cloning procedure used is outlined 
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below. The disclosure of the papers of Sandal et 
al. , the EF application, and the paper of Jensen 
et al-, Nature 321 (12 June 1986), 669-674, includ- 
ing the references cited should be considered in- 
5 corporated into the present description as a means 
to amend, illustrate, and clarify it. 

The N23-CAT gene was transferred to plants by the 
same method as the Lbc 3 - 5 ' - 3 ' - CAT gene. 
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DNA sequence of the 5 '-promoter region from the 
N23 gene 



lO 20 3Q 40 50 60 70 

GAATTC G&GCTCGCCCt^GGATCGATCCTCTAGA GTCGAC CT^ 
EooRI Sail 

80 90 lOO HO 120 130 140 

5 TTCTATTGAGACACGATTTGAACAATTTTTACATTATGAGACTAT^ 

ISO 160 170 ISO 190 200 210 

AAATTTAAAGCTTTAGATGATGATGAATTGAAimAATATTGTArr^ 

220 230 240 250 260 270 280 

ATGAATGCTATGATATTGATGGTCTTGATOTATTNNCAGAATTGAAAGTATTAAGAGAAGTG 

290 300 310 320 330 340 3SO 

JjQ AGAAGTTAGCACACCAATAGAAGTATTGAGTTATATTAAAACTTTAGAOT 

360 370 380 390 400 410 420 

CATATAGAATTTTATTGACAATCCTTATAACAGTTGCTACT^ 

430 440 450 460 470 480 490 

ACTTAAATCATATCTAAAATGAACAAXGTTACAAGATAG^ 

500 510 S20 530 540 550 SCO 

15 AGTAAAGTGTTAGAATTGTTTGATTATAAAACTCTGATAAATG^ 

570 580 590 600 610 620 630 

TAATATAAAAATTG&TATTTTATAT&ATATATTA^ 

640 650 660 670 6SO 690 700 

AAATAATAAAATAAAGCAACTCTTAATTTTAATGAAACATCCCTTTGTTAAACCGA^ 

710 720 730 740 750 760 770 

20 AAAA&TTAATGCTTGATGGAAGTTTraAAT 

780 790 800 810 820 830 840 

TATCATTTATATGTltaTAAArATGAATGCACTAGTA 

850 860 870 880 890 9O0 910 

ATTTCTGTCTCTTGGCAACTCGTGAGAATTGAATATATTATAAAGATGAAAGCT 

920 930 940 950 960 970 9BO 

25 AGAAT AAATATTTATATACAATTCCTAGATTTT GT TATAAAATTCACAT ATTGT ATGAGT AT AAAT ACAT 

990 100O 1010 1020 1030 1040 1050 

GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGCTAAO^ATTAGCGGCTAGCTAAGTAACCAAGTA 

Ddel 

ATTAATG 
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Example 12 

Organ- specif lc expression of the soybean N2 3 - CAT 
gene in root nodules of L. corniculatus and Tri folium 
repens . 

¥ 

5The activity of chloramphenicol acetyl transferase 
(CAT) was measured as in example 5 and is given in 
cpm//xg protein/hrs . 



Table a. CAT activity 

N23-CAT transformed Untrans formed 

L. corniculatus L . corn iculatus 



lO 



15 



Root nodule 86150 0 

Root 0 0 

Table b . CAT activity 

N23 - CAT transformed Untransf ormed 

T . repens T . repens 

Root nodule 148000 0 

Root 0 0 



Table (a) and b) shows the organ- specif ic expression 
of the N2 3 - CAT. gene in root nodules of L . cornicu- 
20 latus and T » repens . L . corniculatus was inoculated 
with Rhizobium loti . while T . repens was inoculated 
with Rhizobium tr if olii ■ 

In connection with the invention it has thus been 
proved that root nodule - specif ic genes can be ex- 
25 pressed organ- specifically upon transfer to other 
plants, h re Lotus corniculatus and Trifolium re - 
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pens . It has furthermore been proved that the 5' 
flanking regions comprising the promoter are con- 
trolled by the organ- specif ic regulatory mechanism 
as the organ- specif ic control of the Lbc3 - 5 ' - 3 ' - CAT 
g gene in Lotus corniculatus took place at the trans- 
cription level. The Lbc3 - 5 r - 3 ' -CAT gene transferred 
was thus only transcribed in root nodules of trans- 
formed plants and not in other organs such as roots, 
stems , and leaves . 

lO ^ ne expression of the Lbc3~5 ' -3 ' -CAT gene in root 
nodules of transformed plants also followed the 
developmental timing known from soybean root nod- 
ules. No CAT activity could be detected in roots 
or small white root nodules (Example 8) „ A low 
activity was present in the further developed white 
distinct nodules, whereas a high activity could be 
measured in the small pink nodules and mature nod- 
ules developed later on . 

The organ-specific expression and the correct de- 
20 velopmental expression of transferred root nodule- 
specific genes, here exemplified by the Lbc3-5'-3'- 
CAT gene , allows as a particular use a functional 
expression of root nodule- specif ic genes also in 
other plants beyond leguminous plants, wlien all 

25the root nodule- specif ic plant genes necessary for 
the formation of root nodules are transferred from 
a leguminous plant to a non- root -nodule - forming 
plant species, the correct organ- specif ic expres- 
sion proved above allows production of functionally 

30active, nitrogen- fixing root nodules on this plant 
upon infection by Rhizobium . In this manner these 
plants can grow without the supply of external 
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inorganic or organic nitrogen compounds. Root nod- 
ule-specific promoters, here exemplified by the 
Lbc 3 and N23 promoters, must be used in the present 
case for regulating the expression of the trans- 
5 f erred genes. 

According to the present invention a root nodule- 
specific promoter is used for expressing genes. 
The gene product or function of the gene product 
improves the function of the root nodule, e.g. by 
10 altering the oxygen transport, the metabolism, the 
nitrogen fixation or the nitrogen absorption. 

Root nodules are thus used for the synthesis of 
biological products improving the plant per se or 
which can be extracted from the plant later on. A 
15 root nodule-specific promoter can be used for ex- 
pressing a gene. The gene product or compound formed 
by said gene product constitute the desired pro- 
duct ( s ) . 

In connection with the present invention it has 
20 furthermore been proved that the soybean Lbc 3 leg- 
hemoglobin protein per se, i.e. the Lbc 3 gene pro- 
duct, is present in a high concentration in root 
nodules of bird's-foot trefoil plants expressing 
the Lbc 3 code sequence under the control of the 
25 Lbc 3 promoter. The latter has been proved by cloning 
the genomic Lbc 3 gene of the soybean into the in- 
tegration vector pARl, said genomic Lbc 3 gene con- 
taining the coding sequence, the intervening se- 
quences, and the 5' and 3' flanking sequences . A 
30 3.6 Kb BamHI fragment Lbc 3 HH, cf. Example 2, was 
cloned into the pARl plasmid and transferred to 
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bird's-foot trefoil as stated previously. 

The high level of LDC3 protein, cf. Example 9, 
found in transformed root nodules of bird's-foot 
trefoil and corresponding to the level in soybean 
5 root nodules proves an efficient transcription of 
the Lbc3 promoter and an efficient processing and 
translation of Lbc 3 mRNA in bird's-foot trefoil. 

The high level of the CAT activity present in trans- 
formed root nodules is also a result of an efficient 

IO translation of mRNA formed from the chimeric Lbc3 
gene. The leader sequence on the Lb 03 gene is de- 
cisive for the translation initiation and must 
determine the final translation efficiency. This 
efficiency is of importance for an efficient syn- 

15 thesis of gene products in plants or plant cells. 
An Lbc 3 or another leghemoglob in leader sequence 
can thus be used for increasing the final expression 
level of a predetermined plant promoter. The con- 
struction of a DNA fragment comprising a Lb leader 
2Qsequence as first sequence and an arbitrary promoter 
as second sequence is a particular use of the in- 
vention when the construction is transferred and 
expressed in plants. 

During nodule development around 30 different plant 
25 encoded polypeptides (nodulins) are specifically 
synthesized. Apart from the leghemoglob ins , nod- 
ulins include nodule - specif ic forms of uricase 
(Bergmann et al (1983) EMBO . J. 2, 2333-2339), 
glutamine synthetase (Cullimore et al (1984) J.Mol. 
3QAppl. Genetics 2, 589-599) and sucrose synthase 
(ttorell and Copeland (1985) Plant. Physiol. 78, 
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149-154). The function of most nodulins are, how- 
ever, at present unknown. 

Many nodulln genes have nevertheless been Isolated 
and characterised during the last five years. These 
5 include nodulins from several different legumes. 
Examples of such isolations and characterisations 
are widespread in the literature such as (Fuller et 
al (1983) Proc. Natl. Acad.Sci. 80, 2594-2598), 
(Sengupta-Gopalan et al (1986) Molec. Gen. Genet. 

10 203, 410-420), (Bisseling et al (1985) in Proceed- 
ings of the 6th Int, symp . on Nitrogen Fixation, 
Martinus Nijhoff Publishers pp 53-59.), and (Geb- 
hardt et al (1986) EMBO . J . 5 , 1429-1435). All of 
these genes contain nodule - specif ic regulatory 

15 sequences. Such sequences and in fact entire 5' 
flanking regions and 3 f flanking regions can fur- 
thermore be synthesized by automated oligonucleotide 
synthesis knowing the DNA sequences for the Lbc3 
and N23 genes given in this description. Entire 

20 nodule-specific genes can also be isolated with 
known recombinant techniques as described in the 
above papers and by (Maniatis et al (1982) Mole- 
cular cloning. A Laboratory Manual, Cold Spring 
Harbour " Laboratory , New York). 

25 The described method to obtain nodule - spec if ic 
expression of genes can thus be reconstructed and 
performed according to the invention by any one 
skilled in the art of molecular genetics. 

The method to obtain nodule - specif ic expression is 
30 not dependent on the A. rhizogenes plant transforma- 
tion described. Any other plant transformation 
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system e.g. A. tumefaciens systems, direct gene 
transfer or microinjection can equally be applied. 

The A. rhizogenes system has been used and charac- 
terised by a number of scientific groups and is 
5 thus well-known from the literature. The character- 
istics of the system is described in: 

Willmitzer et al . (1982), Molec.Gen. 
Genet. 186, 16-22, 

Chilton et al. (1982), Nature 295, 432-434, 

lO Simpson et al . (1986), Plant . Molec . Biol . 

6, 493-415, 

Tepfer D. (1983), Molecular Genetics of 
the Bacteria - Plant interaction, 

Springer Verlag, Berlin Heidelberg pp 
15 248-258, 

White and Nester (1980), J.Bact. 144, 
710-720, 

Jaynes and Strohel (1981), Int. Rev. of Cytol. 
Sup. 13, 105-125, 

20 White and Nester (1980), J. Bact. 141, 

1134-1141, 

Pomponi et al . (1983), Plasmid 10, 119- 
12 9, and 
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Slightom et al . (1986), J. Biol. Chera. 
261, 108-121. 

The latter two publications describe the restriction 
map and nucleotide sequence of the A. rhizoeenes 
5TL-DNA segment used in the transformation system de- 
scribed here. With this information it is possible 
to anybody skilled in molecular genetics to use 
and reconstruct the "intermediate vectors" and the 
A. rhizogenes strains described here. 
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Claims : 

1. A method of expressing genes in plants, parts 
of plants, and plant cell cultures by introducing 
into a cell thereof a recombinant DNA segment con- 

5 taining bath the gene to be expressed and a 5' 
flanking region comprising a promoter sequence, 
and optionally a 3' flanking region, and culturing 
of the transformed cells in a growth medium, 
characterised by using as ° the recom- 
lO binant DNA segment a DNA fragment comprising an 
inducible plant promoter (as defined) from root 
nodule - specif ic genes. 

2. A method as claimed in claim 1, char- 
acterised by using a DNA fragment com- 

15 prising an inducible plant promoter (as defined) 
and being identical with, derived from or comprising 
5' flanking regions of root nodule -specif ic genes. 

3. A method as claimed in claim 2, char- 
acterised by using a DNA fragment com- 

2o prising an inducible plant promoter (as defined) 
and being identical with, derived from or comprising 
5' flanking regions of root nodule - specif ic genes, 
said DNA fragment causing an expression of a gene 
which is induced in root nodules at specific stages 

25 of development and as a step of the symbiosis, 
whereby nitrogen fixation occurs . 

4. A method as claimed in claims 1-3 for the 
expression of root nodule - specif ic genes, 
characterised by using a DNA fragment 

30 comprising an inducible plant prom ter (as defined) 
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from root nodule - specif ic genes. 

5. a method as claimed in claims 1-3 for the 

expression of genes in leguminous plants, parts of 
leguminous plants, and leguminous plant cell cul- 
5 tures, characterised by using a DNA 
fragment comprising an inducible plant promoter 
(as defined) from root nodule - spec if ic genes. 



6. A method as claimed in claims 1-5, char- 
acterised by the DNA fragment comprising 

10 the inducible plant promo ter and being identical 
with, derived from.or comprising 5' flanking regions 
of leghemoglobin genes. 

7. A method as claimed in claim 6, char- 
acterised by the DNA fragment comprising 

15 the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of soybean leghemoglobin genes . 

8 . A method as claimed in claim 7 , char- 
acterised by the DNA fragment comprising 

20 the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of the Lb a gene with the sequence 

GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 
GATATATACC TTCTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 

25 TCTTTTATTT TT AT AAA AAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 
ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 
GGTTAAATCT CATAGTGCCT CTATTCAATA ATTTGGGCTC AATTTAATTA 
GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 
AAAGTTGGTT TTTCTCGAGG AAGAAAGGAA ATGTTAAAAA CTGTGATATT 
TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 
CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATT7GA7 
TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCG7 TGTTTCTTCT 
TCATCATGCT GATTGACACC CTCCACAAGC" CA AG AG A A AC ACATAAGCTT 

30 TGGTTTTCTC ACTCTCCAAG CCCTCTATAT AAACAAATAT TGGAG.TGAAG 
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TTGTTGCATA ACTTGCAT CG AACAATTAAT AGAAA7AACA G AAA ATT AAA 
AAAGAAATAT G- 

9. A method as claimed in claim 7, char- 

acterised by the DNA fragment comprising 
5 the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of the Lbc^ gene with the sequence: 

TTCTCTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 
AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 

10 CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 
TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TACAAAGGAG 
ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 
TAATAAACTT TAAAATCAAA CTTTTTTATA TTTTTTGTTA CCCTTTT CAT 
TATTGGGTGA AATCTCATAG TGAAGCCATT AAATAATTTG GGCTCAAGTT 
TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 
G A A AG G GAG C GAATG T T AAA AAGTGTGATA TTATATTTTA TTTCGATTAA 

15 TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATTCAGAAT 
AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 
TAGGATTTTG AAA AG AT CAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 
C AC A AG CCA A GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 
AT AT A A AC AT GTATTGGATG T G A AGTTATT GCATAACTTG CATTGAACAA 
TAGAAAATAA CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG, 



20 10* A niethod as claimed in claim 7, char- 
acterised by the DNA fragment comprising 
the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of the Lbc2 gene with the sequence: 

25 TCGAGTTTTT . ACTGAACATA CATTTATTAA AAAAAACTCT CTAGTGTCCA 
TTTATTCGGC GAGAAGCCTT CTCGTGCTTT ACACACTTTA ATATTATTAT 
ATCCCCACCC CCACCAAAAA AAAAAAAACT GTTATATCTT TCCAGTACAT 
TTATTTCTTA TTTTTACAAA GGAAACTTCA CGAA AG T A AT TACAAAAAAG 
ATAGTGAACA TCATTTTTTT AGTTAAGATG AATTTTAAAA TCACACTTTT 
TTATATTTTT TTGTTACCCT TTTCATTATT GGGTGAAATC TCATAGTGAA 
ACTATTAAAT AGTTTGGGCT CAAGTTTTAT TAGTAAAGTC TGCATGAAAT 
TTAACTTAAT AATAGAGAGA GTTTTGGAAA GGTAACGAAT GTTAGAAAGT 

30 GTGATATTAT TATAGTTTTA TTTAGATTAA TAATTATGTT TACATGAAAA 
TTGACAATTT ATTTTTAAAA TTCAGAGTAA TACTTAAATT ACT T ATTTAC 
TTTAAGATTT TGAAAAGATC ATTTG GCTCT TC AT CAT G C C GATTGACACC 
CTCCACAAGC CAAGAGAAAC TTAAGTTGTA ATTTTTCTAA CTCCAAGCCT 
TC TATATAAA CACGTATTGG ATGTGAAGTT GTTGCATAAC TTGCATTGAA 
CAATAGAAAT AACAACAAAG AAAATAAGTG AAAAAAGAAA TATG, 
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11. A method as claimed in claim 7, char- 
acterised, by the DNA fragment comprising 
the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
5 of the Lbc3 gene with the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAA TT AC AAA 
ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 

lO AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TT A TT AG TAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 

15 TTCACCATAC CAATTGATCA CCCTCCTCCA AC A AG CCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTCTATA TAAATAA GTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAAAAGAAAT ATG- 

12. A method as claimed in claim 7, c h a r a c- 
20 ter *- sed by the DNA fragment compr is ing the 
inducible plant promoter and being identical with, 
derived from or comprising 5 ' flanking regions of 
the Lbc3 - 5 ' - 3 ' - CAT gene with the sequence; 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 

25 GTACTATTTA AGAAAAGAAA . AAAAAAACCT GCTACATAAT TTCCAATCTT 

GTAGAT.TTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 

ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 

AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 

TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 

TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 

30 CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 

AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 

GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 

CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 

TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 

TTCACCATAC CAATTGATCA CCCTCCTCCA A CA AG CCAAG AGAGACATAA 



GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TAAATAA GTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAATTCTAAA ATG 
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13. A method as claimed In claim 5, charac- 
terised by the DNA. fragment comprising the 
inducible plant promoter and being identical with, 
derived from or comprising 5' flanking regions of 
5 the H23 gene with the sequence: 



lO 20 30 40 50 60 70 

GAATTCGAGCTCGCCCGGGGATCOaTCCTCTAGACT 
EcoRI SaOX 

80 90 lOO 1XO 120 130 140 

TTCT ATT GAGACACGATTTGAACAATTTTT ACATT ATGAGACT ATM 

ISO 160 170 180 190 200 _21Q 

lO AAATTTAAAGCTTTAGATGATGATGAJ^TTGAAN^ 

220 230 240 2SO 260 270 260 

ATGAATGCTATGATATTGATGGTCTTGATOTATTNNCAGAATTGAAACTAT^ 

290 300 310 320 330 340 3SO 

AGAAGTTAGCACACCAATAGAAGTAT*TGAGTTAIAT^ 

360 370 3SO 390 400 410 420 

15 CATATAGAATTTTATTGACAATCCTTATAACAGTTGCT^ 

430 440 450 460 470 480 490 

ACTTAAATCATATCTAAAATCAACAATGTTACAAGATAG^ 

500 5XO 520 530 540 550 560 

ACT AAAGTGTT AGAATTGTTTGATT ATAAAACTCTGAT AAAT GAITTT 

570 SSO 590 600 6lO 620 630 

2Q TAATATAAAAATTGATATTTTATATAATATATTAAGTCTC^ 

640 6 SO 660 670 6 BO 690 700 

AAATAATAAAATAAAGCAACTCTTAATTTTAATGAAACATCCCTTTGTTAA^ 

710 720 730 740 7SO 760 770 

AAAAATTAATGCTTGATGGAAGTTTTTAATTTGTTCTACTCAATACTCAAAG 

„ 780 790 BOO 8IO 820 830 840 

25 TATCATTTATATGTrCTAAATATGAATGCACTAGTAAXTAGTTT 

850 860 870 880 890 900 9lO 

ATTTCTGTCTCTTGGCAACTCCTGAGAATTGAATATATTATAAAGATGAAAGCT 

920 930 940 950 960 970 980 

AGAAT AAATATTTATATACAATTCCT AGATTTTGTTATAAAATTCACAT A GT AT GAGTATAAATACAT 

990 IOOO 1010 1020 1030 1040 1050 

30 GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGCTAATTATTAGCGGCT^ 

rbel 



ATTAATG 
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14. A method as claimed in any of the claims 
1-13, characterised by the 3' flanking 
region of the genes to be expressed being a 3' 
flanking region of root nodule - sp ecific genes of 

5 any origin . 

15. A method as claimed in claim 14, char- 
acterised by the 3' flanking region being 
of leghemoglobin genes. 

16. A method as claimed in claim 14, c h a r - 
lO^cterised by the 3' flanking region being 

of soybean leghemoglobin genes . 

17. A method as claimed in claim 16, char- 
acterised by the 3' flanking region being 
of the Lba, Lbc lf Lbc2 or Lbc 3 gene with the fol- 

15 lowing sequences, respectively: 



Lba 



1590 l«o 

TAA TTA GTA TCT ' ATT GCA GTA AAG TGT AAT AAA TAA ATC TTG 

16SO 1680 
20 *TT CAC TAT AAA ACT TGT TAC TAT TAG ACA AGG GCC TGA TAC AAA ATG TTG GTT AAA ATA 

1710 1740 
ATG GAA TTA TAT AGT ATT GGA TAA AAA TCT TAA GGT TAA TAT TCT ATA TTT GCG TAG GTT 

1770 1800 
TAT GCT TGT GAA TCA TTA TCG GTA TTT TTT TTC CTT TCT GAT AAT TAA TCG GTA AAT TA 

1830 1860 
25 ACA AAT AAG TTC AAA ATG ATT TAT ATG TTT CAA AAT TAT TTT AAC AGC AGG TAA AAT GTT 

ATT TGG TAC GAA AGC TAA TTC GTC GA 
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Lbci 



1320 

TAA/TT AGG ATC TAC TGC ATT GCC GTA 



1350 1380 
AAG TGT AAT AAA TAA ATC TTG TTT CAA CTA AAA CTT GTT ATT AAA CAA GTT CCC TAT ATA 

1410 14 40 

AAT GTT GTT TAA AAT AAG TAA ATT TCA TTG TAT TGG ATA AAC ACT TTT AAG TTA TAT ATT 

1470 1500 
TCC ATA TAT TTA CGT TTG TGA ATC ATA ATC GAT ACT TTA TAA AAA TAA ATT CCA AAT AAT 



TTA TAC GTT TTA AAA ATT ATT TT 



Lbc 



TAG/GAT CTA CTA TTG CCG TCA AGT 

Z140 

GTA ATA AAT AAA TTT TGT TTC ACT AAA ACT TGT TAT TAA ACA AGT CCC CGA TAT ATA AAT 

1170 1200 

GTT GGT TAA AAT AAG TAA ATT ATA CGG TAT TGA TAA ACA ATC TTA AGT TTT ATA TAT AGT 

1230 1260 

TCC ATA TAC TAA AGT TTG TGA ATC ATA ATC GA 

1290 



and Lb c 3 



TAG/GAT CTA CAA TTG CCT TAA AGT GTA ATA AAT AAA 
990 1020 

TAT TAT TTC ACT AAA ACT TGT TAT TAA ACC AAG TTC TCG ATA TAA ATG TTG GTT AAA CTA 

lOSO IO 80 

AGT AAA TTA TAT GGT ATT GGA TAA ACA ATC TTA AGC TT 

lllO 



18 . A method as claimed in claim 1 of preparing 
a polypeptide by introducing into a cell of a plant, 
a part of a plant or a plant cell culture a recombi- 
nant plasmid, characterised by using 
as the recombinant plasmid a plasmid comprising an 
inducible plant promoter (as defined) of root nod- 
ule-specific genes. 



* 
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19. A DNA fragment comprising an inducible plant 
promoter (as defined) to be used when carrying out 
the method as claimed in claims 1-18, char- 
acterised by being identical with, de - 

5 rived from or comprising a 5' flanking region of 
root nodule - specif ic genes of any origin. 

20. A DNA fragment as claimed in claim 19 , 
characterised by being identical with, 
derived from or comprising a 5' flanking region of 

10 plant leghemoglob in genes. 

21. A DNA fragment as claimed in claim 20, 
characterised by being identical with, 
derived from or comprising a 5' flanking region of 
soybean leghemo glob in genes. 

15 22. A DNA fragment as claimed in claim 21, 

characterised by being identical with, 
derived from or comprising a 5' flanking region of 
the Lba gene with the sequence: 



GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 

20 GATATATACC TTCTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 

TCTTTTATTT TTATAAAAAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 

ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 

GGTTAAATCT CAT AGTG CCT CTATTCAATA ATTTGGGCTC AATTTAATTA 

GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 

AAAGTTGGTT TTTCTCGAGG AAGAAAGGAA ATGTTAAAAA CTGTGATATT 

TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 

95 CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATTTGAT 

TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCGT TGTTTCTTCT 

TCATCATGCT GATTGACACC CTCCACAAGC CAAG AGAAAC ACATAAGCTT 

TGGTTTTCTC ACTCTCCAAG CCCTC TATAT AAACAAATAT TGGAGTGAAG 

TTGTTG CAT A ACTTGCATCG AACAATTAAT AGAAATAACA G AAA ATT AAA 

AAAGAAATAT G , 



* 
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23- A DNA fragment as claimed In claim 21, 
characterised by being identical with, 
derived from or comprising a 5' flanking region of 
the Lbc^ gene with the sequence: 

TTCTCTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 
5 AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 
CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 
TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TAG A A AG GAG 
ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 
TAATAAACTT TAAAATCAAA CTTTTTTATA TTTTTTGTTA CCCTTTTCAT 
TATTGGGTGA AATCTCATAG TGAAGCCATT AAATAATTTG GGCTCAAGTT 
TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 
10 GAAAGGGAGC GAATGTTAAA AAGTGTGATA TTATATTTTA TTTCGATTAA 
TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATT C AG A AT 
AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 
TAG G ATTTTG AAA AG AT CAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 
CACAAGCCAA GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 
ATATAAA CAT GTATTGGATG TG A AG TTATT GCATAACTTG CATTGAACAA 
TAG A A A AT A A CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG , 



13 24. A DNA fragment as claimed in claim 21, 

characterised by being identical with, 
derived from or comprising a 5' flanking region of 
the Lbc2 gene with the sequence: 



TCGAGTTTTT 
TTTATTCGGC 

20 ATCCCCACCC 
TTATTTCTTA 
ATAGTGAACA 
^FT AT AT T'TTT 
ACTATTAAAT 
TTAACTTAAT 
GTGATATTAT 
TTGACAATTT 

95 TTTAAGATTT 
CTCCACAAGC 
TC TATATAAA 

/vn X AOnru-i X 



ACTGAACATA 
GAG A AG CCTT 
CCACCAAAAA 
TTTTTACAAA 
TCATTTTTTT 
TTGTTACCCT 
AGTTTGGGCT 
AATAGAGAGA 
TATAGTTTTA 
ATTTTTAAAA 
TGAAAAGATC 
CAAGAGAAAC 
CACGTATTGG 
AACAACAAAG 



CATTTATTAA 
CTCGTGCTTT 
AAAAAAAACT 
GGAAACTTCA 
AGTTAAGATG 
TTTCATTATT 
CAAGTTTTAT 
GTTTTGGAAA 
TTTAGATTAA 
TTCAGAGTAA 
ATTTGGCTCT 
TTAAGTTGTA 
ATGTGAAGTT 
AAAATAAGTG 



AAAAAACTCT 
ACACACTTTA 
GTTATATCTT 
CGAAAGTAAT 
AATTTTAAAA 
GGGTGAAATC 
TAGTAAAGTC 
GGTAACGAAT 
TAATTATGTT 
TACTTAAATT 
TCATCATGCC 
ATTTTTCTAA 
GTTGCATAAC 
AAAAAAGAAA 



» rp ry*m mm ^ rr» 

iilni.ni. xri 1 

TCC AG T AC AT 
TACAAAAAAG 
TCACACTTTT 

rtl/ib x brtrt 

TG C ATG A A AT 
GTTAGAAAGT 
TACATGAAAA 
ACTTATTTAC 

Crp/"^/"* * ■» r** ri-i 

X w \~?\e\\i \_ W x 

TATG , 



25. A DNA fragment as claimed in claim 21, 
characterised by being Identical with, 
30 derived from r comprising a 5' flanking region of 



0249676 



75 

the Lbc3 gene with the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TG C CAT A AG A ACCAACAAAA 
GTACTATTTA AG A A A AG AAA AAAAAAACCT G CT A CAT A AT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
5 ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
in CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TAAATAAGTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAAAAGAAAT ATG * 



15 26. A DNA fragment as claimed in claim 21, 

characterised by the DNA fragment 
comprising the inducible plant promoter being iden- 
tical with, derived from or comprising 5' flanking 
regions of Lbc 3 - 5 ' - 3 ' - CAT gene with the sequence: 

2Q TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 

GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 

GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 

ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 

AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 

TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 

TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 

25 CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 

AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 

GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 

CT AA.nAA/*.A X AX.niitl 1 -.-v-rt. All*- .-—-Lrt. x ^. LnG.tn x rvr* x .-v V- + _ .-v--*-rt. J. • i 

m i h ^ iii mou ^ r— TV**** » A £ i T*f" i r****—* r~ ^ ^ ^ p«inii i i ii wm ^i ^ ^ ^ /—» T*r- < 

TTrtTiTnLib r\A-r»-ri jl bnb x - Ly.rv J. - - .---.Vj ± * J. x * wrvrurt-rvCjx-v iuni-oiL.w 

TTCnV-Urtir.L ^ AA 1 x i. V-rt. LLw.«.«.*L\.rv AU.-%n(j v-\-j-.n.u rtunununi.m 

TTTTATinG Tlni. jlLiuai L-rwt- ^ x_ .^.-vA G^L - w «. x x n. - nV j x .-v 

TTuwMUiwn A*ji x x *j — * ^j\-n. iAALj, A >j\ M .nX L(jxvn^*vni ..n ni.n. uvwvrt x -ru-w 

2q CAGAAAAGTA GAATTCTAAA ATG 



27. A DNA fragment as claimed in claim 19, 
characterised by being identical with, 




# 
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derived from or comprising 5' flanking regions of 
Che N23 gene with the sequence: 



10 20 30 40 50 60 70 

GAATTCGAGCTCGCCCGGGGATCGATCCTCTAG^ 
EcoRI Sail 

80 90 lOO HO 120 130 140 

5 TTCTATTGAGACACGATTTGAACAA 

JU^TTTAAAGCTTTAGAXGATGAT 

Z20 230 240 250 260 270 280 

ATGAATGCTATGATATTGATGGTCTTGATNTATTNNCAGAATTG^ 

290 300 3lO 320 330 340 3SO 

lO AGAAGTTAGCACACCAATAGAAGTATTGAGTTATATTAAAACTTT^ 

360 370 380 390 400 410 420 

CATATAGAATTTTATTGACAATCCTTATAACAGTTGCTACT 

430 440 4 SO 460 470 480 490 

ACTTAAATCATATCTAAAATCAACAATGl^ACAAGATAG^ 

500 SIO 520 530 540 550 560 

15 AGTAAAGTGTTAGAATTGTTTGATTATAAAACTCTGATAAATGA^ 

570 580 590 600 610 620 630 

TAATATAAAAATTGAIIATTTTATATAATATATTAAGTCTCTTT^ 

640 650 660 670 680 690 700 

AAATAATAAAATAAAGCAACTCTTAATTTTAATGAAACATCCCT 

710 720 730 740 750 760 77Q 

20 AAAAATTAATGCTTGATGGAAGTTTTTAATTTGTTCrrACrCCAATACTCAA^ 

780 790 800 810 820 830 840 

TATCATTTATATGTTGTAAATATGAATGCACTAGTAATTAGT^ 

850 860 870 880 890 900 9lO 

ATTTCTGTCTCTTGGCAACTCGTGAGAAl^GAATATATTATAA^ 

920 930 940 9 SO 960 970 980 

25 AGAATAAATAOTTATATACAATTCCTAGATTTTGTTATAAAATTC^ 

990 1O0O lOlO 1020 1030 1040 1050 

GA(^ACACACrAAACTAGTCTCAAATTAAGTAAGGTGCTAATTATTAGCGGCTAGC TAAG TAACC^AGTA 

Odel 

ATTAATG 



28. A plasmid which can be used when carrying 
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out the method as claimed in claims 1-18, 
characterised by comprising a DNA 
fragment as claimed in any of the claims 19-27. 

29. A plasmid as claimed in claim 28, c h a r- 
5acterised by being pAR2 9 . 

30. A plasmid as claimed in claim 28, c h a r- 
acterisedby being pAR30. 

31. A plasmid as claimed in claim 28, char- 
acterised by being pARll. 

10 32. A plasmid as claimed in claim 28, char- 
acterised by being N2 3 - CAT . 

33 . A trans f ormant Agrobac ter ium rhizo genes 15 8 34- 
strain which can be used when carrying out the 
method as claimed in any of the claims 1 to 18, 

15 characterised by the bacterium strain 
being transformed by a plasmid according to any of 
the preceding claims 28 to 32. 

34 . A trans f ormant Agrobac ter ium rhizogenes 15 8 34- 
strain which can be used when carrying out the 

20 metnod as claimed in any of the claims 1 to 18 , 

characterised by the bacterium strain 
being transformed by pAR29 and being named AR1127. 

3 5 . A trans f ormant Agrobac teriun t rhizogenes 15 8 34- 
strain which can be used when carrying out the 
25 method as claimed in any of the claims 1 to 18, 

characterised by the bacterium strain 
being transformed by pAR30 and being named AR1134. 
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36 . A transformant Agrobac ter ium rhizo genes 15834- 
strain which can be used when carrying out the 
method as claimed in any of the claims 1 to 18, 
characterised by the bacterium strain 

5 being transformed by pARll and being named AR1000. 

37 . A transformant Agr obacter ium rhizogenes 15834- 

strain which can be used when carrying out the 

method as claimed in any of the claims 1 to 18 , 

characterised by the bacterium strain 

lO being transformed by N23-CAT and being named AR204- 
N2 3 - CAT . 

38. Plants, parts of plants and plant cells, 
particularly of the family Leguminosae, obtainable 
by transformation* with a recombinant DNA segment, 
15 fragment or plasmid according to any one of the 
claims 1 to 37. 



